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Figure 1 . Construction of a pC5 H6p WNV prM-M-E donor plasmid, pDS-2946-1-1 
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Figure 2. Sequence of C5 H6p WNV prM-M-E C5 in pDS-2646-1-1 

=?C5R 

1 TGAATGTTAA ATGTTATACT TTGGATGAAG CTATAAATAT GCATTGGAAA AATAATCCAT 

61 TTAAAGAAAG GATTCAAATA CTACAAAACC TAAGCGATAA TATGTTAACT AAGCTTATTC 

121 TTAACGACGC TTTAAATATA CACAAATAAA CATAATTTTT GTATAACCTA ACAAATAACT 

181 AAAACATAAA AATAATAAAA GGAAATGTAA TATCGTAATT ATTTTACTCA GGAATGGGGT 

241 TAAATATTTA TATCACGTGT ATATCTATAC TGTTATCGTA TACTCTTTAC AATTACTATT 

301 ACGAATATGC AAGAGATAAT AAGATTACGT ATTTAAGAGA ATCTTGTCAT GATAATTGGG 

361 TACGACATAG TGATAAATGC TATTTCGCAT CGTTACATAA AGTCAGTTGG AAAGATGGAT 

421 TTGACAGATG TAACTTAATA GGTGCAAAAA TGTTAAATAA CAGCATTCTA TCGGAAGATA 

481 GGATACCAGT TATATTATAC AAAAATCACT GGTTGGATAA AACAGATTCT GCAATATTCG 

541 TAAAAGATGA AGATTACTGC GAATTTGTAA ACTATGACAA TAAAAAGCCA TTTATCTCAA 

601 CGACATCGTG TAATTCTTCC ATGTTTTATG TATGTGTTTC AGATATTATG AGATTACTAT 

661 AAACTTTTTG TATACTTATA TTCCGTAAAC TATATTAATC ATGAAGAAAA TGAAAAAGTA 

721 TAGAAGCTGT TCACGAGCGG TTGTTGAAAA CAACAAAATT ATACATTCAA GATGGCTTAC 

781 ATATACGTCT GTGAGGCTAT CATGGATAAT GACAATGCAT CTCTAAATAG GTTTTTGGAC 

841 AATGGATTCG ACCCTAACAC GGAATATGGT ACTCTACAAT CTCCTCTTGA AATGGCTGTA 

901 ATGTTCAAGA ATACCGAGGC TATAAAAATC TTGATGAGGT ATGGAGCTAA ACCTGTAGTT 

961 ACTGAATGCA CAACTTCTTG TCTGCATGAT GCGGTGTTGA GAGACGACTA CAAAATAGTG 

1021 AAAGATCTGT TGAAGAATAA CTATGTAAAC AATGTTCTTT ACAGCGGAGG CTTTACTCCT 

1081 TTGTGTTTGG CAGCTTACCT TAACAAAGTT AATTTGGTTA AACTTCTATT GGCTCATTCG 

1141 GCGGATGTAG ATATTTCAAA CACGGATCGG TTAACTCCTC TACATATAGC CGTATCAAAT 

12 01 AAAAA7TTAA CAATGGTTAA ACTTCTATTG AACAAAGGTG CTGATACTGA CTTGCTGGAT 

12 61 AACATGGGAC GTACTCCTTT AATGATCGCT GTACAATCTG GAAATATTGA AATATGTAGC 
1321 ACACTACTTA AAAAAAATAA AATGTCCAGA ACTGGGAAAA ATTGATCTTG CCAGCTGTAA 

13 81 TTCATGGTAG AAAAGAAGTG CTCAGGCTAC TTTTCAACAA AGGAGCAGAT GTAAACTACA 
1441 TCTTTGAAAG AAATGGAAAA TCATATACTG TTTTGGAATT GATTAAAGAA AGTTACTCTG 
1501 AGACACAAAA GAGGTAGCTG AAGTGGTACT C TCAAAAAGG TACGTGACTA ATTAGCTATA 
1561 AAAAGGATCC GGGTTAATTA ATTAGTCATC AGGCAGGGCG AGAACGAGAC TATCTGCTCG 



.1621 



=>H6p 

TTAATTAATT AGAGC TTCTT TATTCTATAC TTAAAAAGTG AAAATAAATA CAAAGGTTCT 



1681 TGAGGGTTGT GTTAAATTGA AAGCGAGAAA TAATCATAAA TTATTTCATT ATCGCGATAT 

=> WNV capsid leader 
M TGI A V M 1 G L 2 A .5 V 
1741 CCGTTAAGTT TGTATCGTA A TGACCGGAAT TGCAGTCATG ATTGGCCTGA TCGCCAGGGT 

=> WNV prM 

G A V TLSN F 'QG KVM MTVN ATD 
1801 AGGAGCAGTT ACCCTCTCTA ACTTCCAAGG GAAGGTGATG . ATGACGGTAA ATGCTACTGA 

. V T D VITI PTA AGK NLCI VRA 
1861 CGTCACAGAT GTCATCACGA TTCCAACAGC TGCTGGAAAG AACCTATGCA TTGTCAGAGC 

. M D V GYMC DDT ITY ECPV L S A 
1921 AATGGATGTG GGATACATGT GCGATGATAC TATCACTTAT GAATGCCCAG TGCTGTCGGC 

G N D PEDI DCW CTK SAVY VRY 
1981 TGGTAATGAT CCAGAAGACA TCGACTGTTG GTGCACAAAG TCAGCAGTCT ACGTCAGGTA 

=> WNV M 

.GRC TKT R HSR RSR R S L T V Q T 
2 041 TGGAAGATGC ACCAAGACAC GCCACTCAAG ACGCAGTCGG AGGTCACTGA CAGTGCAGAC 

. H G E S T L A N K K G A W MOST K A T 
2101 ACACGGAGAA AGCACTCTAG CGAACAAGAA GGGGGCTTGG ATGGACAGCA CCAAGGCCAC 

. R Y L V K T E S W I L R N P G Y A L V A 
2161 AAGGTATTTG GTAAAAACAG AATCATGGAT CTTGAGGAAC CCTGGATATG CCCTGGTGGC 

.AVI G W M L G S N T M Q R V V F V V L 
2221 AGCCGTCATT GGTTGGATGC TTGGGAGCAA CACCATGCAG AGAGTTGTGT TTGTCGTGCT 

=> WNV E 

■ . L L L V A P A Y S F NC L GMSN RDF 
22 81 ATTGCTTTTG GTGGCCCCAG CTTACAGCTT CAACTGCCTT GGAATGAGCA ACAGAGACTT 

.LEG VSGA . TWV DLV LEGD SCV 
2 341 CTTGGAAGGA GTGTCTGGAG CAACATGGGT GGATTTGGTT CTCGAAGGCG ACAGCTGCGT 

TIM SKDK PTI DVK MMNM EAA 
24 01 GACTATCATG TCTAAGGACA AGCCTACCAT CGATGTGAAG ATGATGAATA TGGAGGCGGC 

. NLA EVRS YCY LAT VSDL STK 

24 61 CAACCTGGCA GAGGTCCGCA GTTATTGCTA TTTGGCTACC GTCAGCGATC TCTCCACCAA 

.AAC PTMG EAH ND.K' RADP AFV 
2521 AGCTGCGTGC CCGACCATGG GAGAAGCTCA CAATGACAAA CGTGCTGACC CAGCTTTTGT 

.CRQ GVV D RGW GNG CGLF GKG 

25 81 GTGCAGACAA GGAGTGGTGG ACAGGGGCTG GGGCAACGGC TGCGGACTAT TTGGCAAAGG 

.SID TCAK FAC STK AIGR TIL 
2 641 AAGCATTGAC ACATGCGCCA AATTTGCCTG CTCTACCAAG GCAATAGGAA GAACCATCTT 



mutated T5NT 

. KEN IKYE VAI FVH GPTT V E S 
27 01 GAAAGAGAAT ATCAAGTACG AAGTGGCCA T CTTCGT GCAC GGACCAACTA CTGTGGAGTC 

. H G N Y S T " Q VGA T Q A G R F S I T P 
2 761 GCACGGAAAC TACTCCACAC AGGTTGGAGC CACTCAGGCA GGGAGATTCA GCATCACTCC 

. A A P SYTL KLG EYG E V T V DCE 
2821 TGCGGCGCCT TCATACACAC TAAAGCTTGG AGAATATGGA GAGGTGACAG TGGACTGTGA 

. P R S GIDT NAY YVM TVG T KTF 
2 881 ACCACGGTCA GGGATTGACA CCAATGCATA CTACGTGATG ACTGTTGGAA CAAAGACGTT 

. L V H REWF MDL NLP'WSSA GST 

2 941 CTTGGTCCAT CGTGAGTGGT TCATGGACCT CAACCTCCCT TGGAGCAGTG CTGGAAGTAC 

.VWR'NRET LME FEE PHAT KQS 
30 01 TGTGTGGAGG AACAGAGAGA CGTTAATGGA GTTTGAGGAA CCACACGCCA CGAAGCAGTC 

.VIA LGSQ EGA LHQ ALAG AIP 

3 061 TGTGATAGCA TTGGGCTCAC AAGAGGGAGC TCTGCATCAA GCTTTGGCTG GAGCCATTCC 

V E F SSNT V K L TSG HLKC R V. K 
3121 TGTGGAATTT TCAAGGAACA CTGTCAAGTT GACGTCGGGT CATTTGAAGT GTAGAGTGAA 

.MEK LQLK GTT Y GV CSKA FKF 
3181 GATGGAAAAA TTGCAGTTGA AGGGAACAAC CTATGGCGTC TGTTCAAAGG CTTTCAAGTT 

.LGT PADT GHG TVV LELQ YTG 
3241 TCTTGGGACT CCCGCAGACA CAGGTCACGG CACTGTGGTG TTGGAATTGC AGTACACTGG 

. T D G PCKV PIS S A A SLND LTP 
3301 CACGGATGGA CCTTGCAAAG TTCCTATCTC GTCAGCGGCT TCATTGAACG ACCTAACGCC 

.VGR L V T V NPF VSV AT 'AN' AKV 
3361 AGTGGGCAGA TTGGTCACTG TCAACCCTTT TGTTTCAGTG GCCACGGCCA ACGCTAAGGT 

.LIE LEPP FGD SYI VV GR GEQ 
3421 CCTGATTGAA TTGGAACCAC CCTTTGGAGA CTCATACATA GTGGTGGGCA GAGGAGAACA 

. Q I N HHWH KSG SSI GKAF TTT 
34 81 ACAGATCAAT CACCATTGGC ACAAGTCTGG AAGCAGCATT GGCAAAGCCT TTACAACCAC 

. L K G AQRL A A L GDT AWDF GSV 
3 541 CCTCAAAGGA GCGCAGAGAC TAGCCGCTCT AGGAGACACA GCTTGGGACT TTGGATCAGT 

. G G V FTSV GKA VHQ VFGG AFR 
3601 TGGAGGGGTG TTCACCTCAG TTGGGAAGGC TGTCCATCAA GTGTTCGGAG GAGCATTCCG 

. S L F GGMS WIT QGL LGAL LLW 
3661 CTCACTGTTC GGAGGCATGT CCTGGATAAC GCAAGGATTG CTGGGGGCTC TCCTGTTGTG 

. M G I NARD RSI ALT FLAV GGV 
3 721 GATGGGCATC AATGCTCGTG ATAGGTCCAT AGCTCTCACG TTTCTCGCAG TTGGAGGAGT 

. L L • F LSVN V H A 
3 7 81 TCTGCTCTTC CTCTCCGTGA ACGTGCACGC TTAATTTTTA TCTAGAATCG ATCCCGGGTT 



=> C5L 

3841 TTTATGACTA GTTAATCACG GCCGCCTTAT AAAGATCTAA AATGCATAAT TTCTAAATAA 

3 901 TGAAAAAAA G TACATCATGA GCAACGCGTT AGTATATTTT ACAATGGAGA TTAACGCTCT 

3 961 ATACCGTTCT ATGTTTATTG ATTCAGATGA TGTTTTAGAA AAGAAAGTTA TTGAATATGA 

4021 AAACTTTAAT GAAGA TGAAG ATGACGACGA TGATTATTGT TGTAAATCTG TTTTAGATGA 

4081 AGAAGA TGA C GCGCTAAAGT ATACTATGGT TACAAAGTAT AAGTCTATAC TACTAATGGC 

4141 GACTTGTGCA AGAAGGTATA GTATAGTGAA AATGTTGTTA GATTATGATT ATGAAAAACC 

4201 AAA TAAA TCA GATCCATATC TAAAGGTATC TCCTTTGCAC ATAATTTCAT CTATTCCTAG 

4261 TTTAGAATAG 



Figure 3. Construction of a pF8 H6p WNV prM-M-E donor plasmid, pSL-5513-l-M. 
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Figure 4. Sequence of F8 H6p WNV prM-M-E F8 in pSL-551 3-1-1-1. 

=> F8R 

1 GACCCTTTAC AAGAATAAAA GAAGAAACAA CTGTGAAATA GTTTATAAAT GTAATTCGTA 

61 TGCAGAAAAC GATAATATAT TTTGGTATGA GAAATCTAAA GGAGACATAG TTTGTATAGA 

121 CATGCGCTCT TCCGATGAGA TATTCGATGC TTTTCTAATG TATCATATAG CTACAAGATA 

181 TGCCTATCAT GATGATGATA TATATCTACA AATAGTGTTA TATTATTCTA ATAATCAAAA 

241 TGTTATATCT TATATTACGA AAAATAAATA CGTTAAGTAT ATAAGAAATA AAACTAGAGA 

3 01 CGATATTCAT AAAGTAAAAA TATTAGCTCT AGAAGACTTT ACAACGGAAG AAATATATTG 
361 TTGGATTAGT AATATATAAC AGCGTAGCTG CACGGTTTTG ATCATTTTCC AACAATATAA 
421 ACCAATGAAG GAGGACGACT CATCAAACAT AAATAACATT CACGGAAAAT ATTCAGTATC 

4 81 AGATTTATCA CAAGATGATT ATGTTATTGA ATGTATAGAC GGATCTTTTG ATTCGATCAA 
541 GTATAGAGAT ATAAAGGTTA TAATAATGAA GAATAACGGT TACGTTAATT GTAGTAAATT 
601 ATGTAAAATG CGGAATAAAT ACTTTTCTAG ATGGTTGCGT CTTTCTACTT CTAAAGCATT 
661 ATTAGACATT TACAATAATA AGTCAGTAGA TAATGCTATT GTTAAAGTCT ATGGTAAAGG 
721 TAAGAAACTT ATTATAACAG GATTTTATCT CAAACAAAAT ATGATACGTT ATGTTATTGA 
781 GTGGATAGGG GATGATTTTA CAAACGATAT ATACAAAATG ATTAATTTCT ATAATGCGTT 
841 ATTCGGTAAC GATGAATTAA AAATAGTATC CTGTGAAAAC ACTCTATGCC CGTTTATAGA 
901 ACTTGGTAGA TGCTATTATG GTAAAAAATG TAAGTATATA CACGGAGATC AATGTGATAT 
961 CTGTGGTCTA TATATACTAC ACCCTACCGA TATTAACCAA CGAGTTTCTC ACAAGAAAAC 

1021 TTGTTTAGTA GATAGAGATT CTTTGATTGT GTTTAAAAGA AGTACCAGTA AAAAGTGTGG 

1081 CATATGCATA GAAGAAATAA ACAAAAAACA TATTTCCGAA CAGTATTTTG GAATTCTCCC 

1141 AAGTTGTAAA CATATTTTTT GCCTATCATG TATAAGACGT TGGGCAGATA CTACCAGAAA 

1201 TACAGATACT GAAAATACGT GTCCTGAATG TAGAATAGTT TTTCCTTTCA TAATACCCAG 

12 61 TAGGTATTGG ATAGATAATA AATATGATAA AAAAATATTA TATAATAGAT ATAAGAAAAT 

1321 GATTTTTACA AAAATACCTA TAAGAACAAT AAAAATATAA TTACATTTAC GGAAAATAGC 

1381 TGGTTTTAGT TTACCAACTT AGAGTAATTA TCATATTGAA TCTATATTGC TAATTAGCTA 

1441 ATAAAAACCC GGGTTAATTA ATTAGTCATC AGGCAGGGCG AGAACGAGAC TATCTGCTCG 

=> H6p 

1501 TTAATTAATT AGAGCTTCTT TATTCTATAC TTAAAAAGTG AAAATAAATA CAAAGGTTCT 



1561 



TGAGGGTTGT GTTAAATTGA AAGCGAGAAA TAATCATAAA TTATTTCATT ATCGCGATAT 



=> WNV capsid leader 
M TGI A V M I G h I A S V 
1621 CCGTTAAGTT TGTATCGTA A TQACCGGAAT TGCAGTCATG ATTGGCCTGA TCGCCAGCGT 

=> WNV prM 

. G A V TLS N FQG KVM MTVN ATD 
16 81 AGGAGCGGTT ACCCTCTCTA ACTTCCAAGG GAAGGTGATG ATGACGG.TAA ATGCTACTGA 

. V T D VITI PTA AGK NLCI V R A 
1741 CGTCACAGAT GTCATCACGA TTCCAACAGC TGCTGGAAAG AACCTATGCA TTGTCAGAGC 

. M D V GYMC DDT I T Y E C P V LSA 
1801 AATGGATGTG GG AT AC ATGT GCGATGATAC TATCACTTAT GAATGCCCAG TGCTGTCGGC 

.GND PEDI DCW CTK SAVY VRY 
1861 TGGTAATGAT CCAGAAGACA TCGACTGTTG GTGCACAAAG TCAGCAGTCT ACGTCAGGTA 

=> WNV M 

. G R C TKTR H SR RSR R S L T V 0 T 
1921 TGGAAGATGC ACCAAGACAC GCCACTCAAG ACGCAGTCGG AGG TCACTGA CAGTGCAGAC 

. H G E S T L A N K K G A W MOST K A T 
1981 ACACGGAGAA AGCACTCTAG CGAACAAGAA GGGGGCTTGG ATGGACAGCA CCAAGGCCAC 

R Y L V K T E S W I L R N P G Y A L V A 
2041 AAGGTATTTG GTAAAAACAG AATCATGGAT CTTGAGGAAC CCTGGATATG CCCTGGTGGC 

.AVI G W M L G S N T M Q R V V F V V L 
2101 AGCCGTCATT GGTTGGATGC TTGGGAGCAA CACCATGCAG AGAGTTGTGT TTGTCGTGCT 

WNV E 

. L L L V A P A Y S F NCL GMSN RDF 
2161 ATTGCTTTTG GTGGCCCCAG CTTACAGCTT CAACTGCCTT GGAATGAGCA ACAGAGACTT 

.LEG VSGA TWV DLV LEGD SCV 
2221 CTTGGAAGGA GTGTCTGGAG CAACATGGGT GGATTTGGTT CTCGAAGGCG ACAGCTGCGT 

TIM SKDK PTI DVK MMNM EAA 
22 81 GACTATCATG TCTAAGGACA AGCCTACCAT CGATGTGAAG ATGATGAATA TGGAGGCGGC 

.NLA EV-RS YCY L A T VSDL STK 
2341 CAACCTGGCA GAGGTCCGCA GTTATTGCTA TTTGGCTACC GTCAGCGATC TCTCCACCAA 

. A A C PTMG EAH NDK RADP AFV 
2401 AGCTGCGTGC CCGACCATGG GAGAAGCTCA CAATGACAAA CGTGCTGACC CAGCTTTTGT 

.CRQ GVVD RGW GNG CGLF GKG 
2461 GTGCAGACAA GGAGTGGTGG ACAGGGGCTG GGGCAACGGC TGCGGACTAT TTGGCAAAGG 

.SID T-CAK FAC STK AIGR TIL 
2 521 AAGCATTGAC ACATGCGCCA AATTTGCCTG CTCTACCAAG GCAATAGGAA GAACCATCTT 

. KEN IKYE VAI FVH GPTT VES 
2 5 81 GAAAGAGAAT ATCAAGTACG AAGTGGCCAT CTTCGTGCAC GGACCAACTA CTGTGGAGTC 



.HGN YSTQ VGA TQA GRFS ITP 
2641 GCACGGAAAC TACTCCACAC AGGTTGGAGC CACTCAGGCA GGGAGATTCA GCATCACTCC 



. A A P SYTL KLG EYG E V T V DCE 
2701 TGCGGCGCCT TCATACACAC TAAAGCTTGG AGAATATGGA GAGGTGACAG TGGACTGTGA 

.PRS GIDT NAY Y V M TVGT K T F 
2 761 ACCACGGTCA GGGATTGACA CCAATGCATA CTACGTGATG ACTGTTGGAA CAAAGACGTT 

. L ^ V H REWF MDL NLP WSSA GST 
2821 CTTGGTCCAT CGTGAGTGGT TCATGGACCT CAACCTCCCT TGGAGCAGTG CTGGAAGTAC 

.VWR NRET LME FEE PHAT KQS 
2881 TGTGTGGAGG AACAGAGAGA CGTTAATGGA GTTTGAGGAA CCACACGCCA CGAAGCAGTC 

.VIA LGSQ EGA L H Q ALAG AIP 

2 941 TGTGATAGCA TTGGGCTCAC AAGAGGGAGC TCTGCATCAA GCTTTGGCTG GAGCCATTCC 

. V E F SSNT VKL TSG HLK"C RVK 

3 001 TGTGGAATTT TCAAGCAACA CTGTCAAGTT GACGTCGGGT CATTTGAAGT GTAGAGTGAA 

.MEK LQLK GTT Y G. V CSKA FKF 
3061 GATGGAAAAA TTGCAGTTGA AGGGAACAAC CTATGGCGTC TGTTCAAAGG CTTTCAAGTT 

. L G T P ADT GHG TVV LELQ YTG 
3121 TCTTGGGACT CCCGCAGACA CAGGTCACGG CACTGTGGTG TTGGAATTGC AGTACACTGG 

. T D G PCKV PIS SAA SLND LTP 
3181 CACGGATGGA CCTTGCAAAG TTCCTATCTC GTCAGCGGCT TCATTGAACG ACCTAACGCC 

. V G R LVTV NPF VSV ATAN AKV 
3 241 AGTGGGCAGA TTGGTCACTG TCAACCCTTT TGTTTCAGTG GCCACGGCCA ACGCTAAGGT 

. LIE LEPP FGD SYI VVGR GEQ 
3 301 CCTGATTGAA TTGGAACCAC CCTTTGGAGA CTCATACATA GTGGTGGGCA GAGGAGAACA 

. Q I N HHWH KSG SSI GKAF TTT 
3 361 ACAGATCAAT CACCATTGGC ACAAGTCTGG AAGCAGCATT GGCAAAGCCT TTACAACCAC 

. L K G AQRL A A L GDT AWDF. GSV 
3421 CCTCAAAGGA GCGCAGAGAC TAGCCGCTCT AGGAGACACA GCTTGGGACT TTGGATCAGT 

G G V FTSV GKA VHQ V FGG A F R 
34 81 TGGAGGGGTG TTCACCTCAG TTGGGAAGGC TGTCCATCAA GTGTTCGGAG GAGCATTCCG 

. S L F GGMS WIT QGL LGAL LLW 
3541 CTCACTGTTC GGAGGCATGT CCTGGATAAC GCAAGGATTG CTGGGGGCTC TCCTGTTGTG 

. M G I NARD. RSI ALT FLAV GGV 
3601 GATGGGCATC AATGCTCGTG ATAGGTCCAT AGCTCTCACG TTTCTCGCAG TTGGAGGAGT 

. L L F LSVN V H A 
3661 TCTGCTCTTC CTCTCCGTGA ACGTGCACGC TTAATTTTTA TCTAGAGTCG AGTTTTTATT 
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Figure 5. Immunoblot analysis of the expression of WNV proteins from pox recombinants in 
CEFs 
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Figure 6. Immunoblot analysis of the expression of WNV proteins from pox recombinants in 
BHK cells 
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Lane 1: vFP2000.3.2. 1.2.1 pellet 
Lane 2: vCP201 7.3.3 pellet 
Lane 3: vCP201 8.6.3.3.2 pellet 
Lane 4: mock infected BHK pellet 
Lane 5: pTriEx-WNV transfection pellet 
Lane 6. mock transfected BHK pellet 
Lane 7: vFP2000.3.2. 1.2.1 supt 
Lane 8: vCP201 7.3.3 supt 
Lane 9: vCP201 8.6.3.3.2 supt 
Lane 10: mock infected BHK supt 
Lane 11 : pTriEx-WNV transfection supt 
Lane 1 2: mock transfection supt 



vCP2017 = ALVACWNV 
VCP2018 = ALVAC-2 WNV 
VFP2000 = Fowlpox WNV 



Figure 7. Construction of pVR1 01 2 WNV prM-M-E, pSL-5448-1 -1 . 




pVR1012 WNV prM-M-E (no Kozak) 



Oligos 7743.SL77744.SLI 
Pst I - EcoR V I Pst l-EcoRV Kozak 



Nhol (M<i eh 




pVR1012 WNV prM-M-E 



00150849.DOC 



Figure 8. Nucleotide sequence and translation of the WNV prM-M-E region in pSL-5448-1-1, 
pVR1012 WNV prM-M-E. 



PstI Kozak => WNV capsid leader 

M G S T G I A V M G 1 • A S V 

1 |CTGCAG|CC GC CACCATGG GA TCAACGGGAA TTGCAGTCAT GATTGGCCTG ATCGCCAGCG 

=> WNV prM 

. . G A V TLS NFQG KVM MTV NATO' 
61 I TAGGAGCAGT TACCCTCTCT AACTTCCAAG GGAAGGTGAT GATGACGGTA AATGCTACTG 

. . V T D V I T IPTA AGK NLC I V R A • 
121 ACGTCACAGA TGTCATCACG ATTCCAACAG CTGCTGGAAA GAACCTATGC ATTGTCAGAG 

. . M D V GYM CDDT I T Y ECP V L S A 
181 CAATGGATGT GGGATACATG TGCGATGATA CTATCACTTA TGAATGCCCA GTGCTGTCGG 

G N D PED IDCW CTK S A V YVRY 
241 CTGGTAATGA TCCAGAAGAC ATCGACTGTT GGTGCACAAA GTCAGCAGTC TACGTCAGGT 

t. 

=> WNV M 

. . G R C TKT RHSR-RSR RSI T V Q T 
301 ATGGAAGATG CACCAAGACA CGCCACTCAA GACGCAGTCG GAGG TCACTG ACAGTGCAGA 

. . H G E STL A N K K G A W ' M D S THAT 
361 CACACGGAGA AAGCACTCTA GCGAACAAGA AGGGGGCTTG GATGGACAGC ACCAAGGCCA 

. . R Y L V K T E S W I ' L R N P G Y ALVA 
421 CAAGGTATTT GGTAAAAACA GAATCATGGA TCTTGAGGAA CCCTGGATAT GCCCTGGTGG 

..AVI G W M L G S N T M Q R V V F V V L 
4 81 CAGCCGTCAT TGGTTGGATG CTTGGGAGCA ACACCATGCA GAGAGTTGTG TTTGTCGTGC 

=> WNV E 

. . L L L V A P A Y S F NCL GMS NRDF 
541 TATTGCTTTT GGTGGCCCCA GCTTACAGCT TCAACTGCCT TGGAATGAGC AACAGAGACT 

..LEG VSG ATWV D L V LEG DSCV 
601 TCTTGGAAGG AGTGTCTGGA GCAACATGGG TGGATTTGGT TCTCGAAGGC GACAGCTGCG 

Clal 

..TIM SKD KPT I D VK MMN MEAA 

661 TGACTATCAT GTCTAAGGAC AAGCCTACC |A TCGAlp TGAA GATGATGAAT ATGGAGGCGG 

..NLA EVR SYCY LAT VSD LSTK 
721 CCAACCTGGC AGAGGTCCGC AGTTATTGCT ATTTGGCTAC CGTCAGCGAT CTCTCCACCA 

A A C PTM GEAH NDK RAD PAFV 
781 AAGCTGCGTG CCCGACCATG GGAGAAGCTC ACAATGACAA ACGTGCTGAC CCAGCTTTTG 

. . C R Q. G V V DRGW GNG CGL FGKG 
841 TGTGCAGACA AGGAGTGGTG GACAGGGGCT GGGGCAACGG CTGCGGACTA TTTGGCAAAG 

..SID TCA KFAC STK A I G. R T I L 
901 GAAGCATTGA CACATGCGCC AAATTT^GCCT GCTCTACCAA GGCAATAGGA AGAACCATCT 

..KEN IKY EVAI F V H GPT TVES 
961 TGAAAGAGAA TATCAAGTAC GAAGTGGCCA TTTTTGTCCA TGGACCAACT ACTGTGGAGT 

. . H G N YST QVGA T QA GRF SITP 
1021 CGCACGGAAA CTACTCCACA CAGGTTGGAG CCACTCAGGC AGGGAGATTC AGCATCACTC 
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. . A A P SYT LKLG EYG EV.T. V D C E • 
1081 CTGCGGCGCC TTCATACACA CTAAAGCTTG GAGAATATGG AGAGGTGACA GTGGACTGTG 

..PRS GID TNAY YVM TVG TKTF- 
1141 AACCACGGTC AGGGATTGAC ACCAATGCAT ACTACGTGAT GACTGTTGGA ACAAAGACGT 

. . L V H REW FMDL *N L P WSS A G S T • 
1201 TCTTGGTCCA TCGTGAGTGG TTCATGGACC TCAACCTCCC TTGGAGCAGT GCTGGAAGTA 

V W R NRE T - L M E FEE PHA T K Q S ■ 
1261 CTGTGTGGAG GAACAGAGAG ACGTTAATGG AGTTTGAGGA ACCACACGCC ACGAAGCAGT 

..VIA LGS Q E G A LHQ ALA G A I P • 
1321 CTGTGATAGC ATTGGGCTCA CAAGAGGGAG CTCTGCATCA AGCTTTGGCT ' GGAGCCATTC 

. . V E F SSN TVKL TSG HLK C R V K • 
13 81 CTGTGGAATT TTCAAGCAAC ACTGTCAAGT TGACGTCGGG TCATTTGAAG TGTAGAGTGA 

. . m E K LQL KGTT Y G V CSK A F K F • 
1441 AGATGGAAAA ATTGCAGTTG AAGGGAACAA CCTATGGCGT CTGTTCAAAG GCTTTCAAGT 

. . L G T PAD TGHG TVV LEL Q.YTG- 
1501 TTCTTGGGAC TCCCGCAGAC ACAGGTCACG GCACTGTGGT GTTGGAATTG CAGTACACTG 

. . T D G PCK VPIS SAA SLN DLTP* 
1561 GCACGGATGG ACCTTGCAAA GTTCCTATCT CGTCAGCGGC TTCATTGAAC GACCTAACGC 

. . V G R LVT VNPF VSV ATA N A. K V • 
1621 CAGTGGGCAG ATTGGTCACT GTCAACCCTT TTGTTTCAGT GGCCACGGCC AACGCTAAGG 

..LIE LEP PFGD SYI VVG R G E Q • 
1681 TCCTGATTGA ATTGGAACCA CCCTTTGGAG ACTCATACAT AGTGGTGGGC AGAGGAGAAC . 

. . Q I N HHW HKSG SSI GKA F T T T • 
1741 AACAGATCAA TCACCATTGG CACAAGTCTG GAAGCAGCAT TGGCAAAGCC TTTACAACCA 

. . L K G AQR LAAL GDT AWD F G S V • 
1801 CCCTCAAAGG AGCGCAGAGA CTAGCCGCTC TAGGAGACAC AGCTTGGGAC TTTGGATCAG 

. . G G V FTS VGKA VHQ VFG G A F R ■ 
1861 TTGGAGGGGT GTTCACCTCA GTTGGGAAGG CTGTCCATCA AGTGTTCGGA GGAGCATTCC 

. . S L F GGM SWIT QGL LGA LLLW- 
1921 GCTCACTGTT CGGAGGCATG TCCTGGATAA CGCAAGGATT GCTGGGGGCT CTCCTGTTGT 

. . M G I NAR DRSI ALT FLA V G G V • 
1981 GGATGGGCAT CAATGCTCGT GATAGGTCCA TAGCTCTCAC GTTTCTCGCA GTTGGAGGAG 

Xbal 

L L F LSV NVHA 
2 041 TTCTGCTCTT CCTCTCCGTG AACGTGCACG CTTAATTTTT A |TCTAGA| 
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FIGURE 9 

Sequence of pDS-2946-1-1 , pC5 H6p WNV prM-M-E. 

1 GCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCA 

61 CGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGCT 

121 * CACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAAT 

=> C5R 

181 TGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCATGATTACGAATTGCGGCC 

241 GCAATTCTGAATGTTAAATGTTATACTTTGGATGAAGCTATAAATATGCATTGGAAAAAT 

301 AATCCATTTAAAGAAAGGATTCAAATACTACAAAACCTAAGCGATAATATGTTAACTAAG 

3 61 CTTATTCTTAACGACGCTTTAAATATACACAAATAAACATAATTTTTGTATAACCTAACA 

421 AATAACTAAAACATAAAAATAATAAAAGGAAATGTAATATCGTAATTATTTTACTCAGGA 

481 ATGGGGTTAAATATTTATATCACGTGTATATCTATACTGTTATCGTATACTCTTTACAAT 

54 1 TACTATTACGAATATGCAAGAGATAATAAGATTACGTATTTAAGAGAATCTTGTCATGAT 

601 AATTGGGTACGACATAGTGATAAATGCTATTTCGCATCGTTACATAAAGTCAGTTGGAAA 

661 GATGGATTTGACAGATGTAACTTAATAGGTGCAAAAATGTTAAATAACAGCATTCTATCG 

721 GAAGA TAGGA TACCAG TTATA TTA TACAAAAA TCACTGGTTGGA TAAAACAGATTCTGCA 

781 ATATTCGTAAAAGATGAAGATTACTGCGAATTTGTAAACTATGACAATAAAAAGCCATTT 

841 A TCTCAA CGA CA TCG TGTAA TTCTTC CA TG TTTTA TG TA TGTG TTTCAGA TA TTA TGAGA 

901 TTACTATAAACTTTTTGTATACTTATATTCCGTAAACTATATTAATCATGAAGAAAATGA 

961 AAAAGTATAGAAGCTGTTCACGAGCGGTTGTTGAAAACAACAAAATTATACATTCAAGAT 

1021 GGCTTACATATACGTCTGTGAGGCTATCATGGATAATGACAATGCATCTCTAAATAGGTT 

1081 TTTGGACAATGGATTCGACCCTAACACGGAATATGGTACTCTACAATCTCCTCTTGAAAT 

1141 GGCTGTAA TGTTCAAGAA TACCGAGGCTA TAAAAA TCTTGA TGAGGTATGGAGCTAAACC 

12 01 TGTAGTTACTGAATGCACAACTTCTTGTCTGCATGATGCGGTGTTGAGAGACGACTACAA 

1261 AA TAGTGAAAGA TCTGTTGAAGAA TAA CTA TG TAAA CAA TGTTC TTTACAGCGGAGGC TT 

1321 XACTCCTTTGTGTTTGGCAGCTTACCTTAACAAAGTTAATTTGGTTAAACTTCTATTGGC 

1381 TCA TTCGGCGGA TGTAGA TA TTTCAAACACGGA TCGGTTAACTCCTCTA CA TA TAGCCG T 

1441 ATCAAATAAAAATTTAACAATGGTTAAACTTCTATTGAACAAAGGTGCTGATACTGACTT 

1501 GCTGGA TAACA TGGGA TGTACTCCTTTAATGA TCGCTGTACAA TCTGGAAA TATTGAAA T 
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1561 ATGTAGCACACTACTTAAAAAAAATAAAATGTCCAGAACTGGGAAAAATTGATCTTGCCA 



1621 GCTGTAATTCATGGTAGAAAAGAAGTGCTCAGGCTACTTTTCAACAAAGGAGCAGATGTA 

1681 AACTACATCTTTGAAAGAAATGGAAAATCATATACTGTTTTGGAATTGATTAAAGAAAGT 

1741 TACTCTGAGACACAAAAGAGGTAGCTGAAGTGGTACTCTCAAAGGTACGTGACTAA TTAG 

1801 CTA TAAAAAGGATCCGGGTTAATTAATTAGTCATCAGGCAGGGCGAGAACGAGACTATCT 

=> H6p 

1861 GCTCGTTAATTAATTAGAGC TTCTTTATTCTATACTTAAAAAGTGAAAATAAATACAAAG 

1921 GTTCTTGAGGGTTGTGTTAAATTGAAAGCGAGAAATAATCATAAATTATTTCATTATCGC 

=> WNV capsid leader 
MTGIAVMIGLIA 
1981 GATATCCGTTAAGTTTGTATCGTA ATGACCGGAATrGCAGTCATGATTGGCCrGATCGCC 

=> WNV prM start 
SVGAVTLSNFQGKVMMTVNA 
2 041 AGCGTAGGAGCGGTT ACCCTCTCTAACTTCCAAGGGAAGGTGATGATGACGGTAAATGCT 

TDVTDVITI PTAAGKNLC IV 
2101 ACTGACGTCACAGATGTCATCACGATTCCAACAGCTGCTGGAAAGAACCTATGCATTGTC 

RAMDVGYMCDDTITYECPVL 
2161 AGAGCAATGGATGTGGGATACATGTGCGATGATACTATCACTTATGAATGCCCAGTGCTG 

SAGNDPEDIDCWCTKSAVYV 
2221 TCGGCTGGTAATGATCCAGAAGACATCGACTGTTGGTGCACAAAGTCAGCAGTCTACGTC 

=> WNV M start 

R YGRCTKTRHSRRS RRSLTV 
2281 AGGTATGGAAGATGC ACCAAGACACGCCACTCAAGACGCAGTCGGAGG TCACTGACAGTG 

OTHGESTLANKKGAWMDSTK 

2 341 CAGACACACGGAGAAAGCACTCTAGCGAACAAGAAGGGGGCTTGGATGGACAGCACCAAG 

ATRYLVKTESWILRNPGYAL 
2401 GCCACAAGGTATTTGGTAAAAACAGAATCATGGATCTTGAGGAACCCTGGATATGCCCTG 

V A A VI G WML G S N T M Q R V V F V 
2461 GTGGCAGCCGTCATTGGTTGGATGCTTGGGAGCAACACCATGCAGAGAGTTGTGTTTGTC 

=> WNV E start 

V L L L L V A P A YSFNCLGMSNR 

2 521 GTGCrATTGCTTTTGGrGGCCCCAGCTTACAGCTTCAACTGCCTTGGAATGAGCAACAGA 

DFLEGVSGATWVDLVLEG.DS 
2581 GACTTCTTGGAAGGAGTGTCTGGAGCAACATGGGTGGATTTGGTTCTCGAAGGCGACAGC 

CVTIMS KDKPTID VKMMNME 
2641 TGCGTGACTATCATGTCTAAGGACAAGCCTACCATCGATGTGAAGATGATGAATATGGAG 
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AANLAEV RSYCYLA TVS DLS 
2701 GCGGCCAACCTGGCAGAGGTCCGCAGTTATTGCTATTTGGCTACCGTCAGCGATCTCTCC 

TKAAC PT MGEAHNDKRADPA 
2 761 ACCAAAGCTGCGTGCCCGACCATGGGAGAAGCTCACAATGACAAACGTGCTGACCCAGCT 

FVCRQGVVDRGWGNGCGLFG 
2 821 TTTGTGTGCAGACAAGGAGTGGTGGACAGGGGCTGGGGCAACGGCTGCGGACTATTTGGC 

KG S IDTCAKFACS-.TKAIGRT 
2 881 AAAGGAAGCATTGACACATGCGCCAAATTTGCCTGCTCTACCAAGGCAATAGGAAGAACC 

mutated T5NT 

I L KEN I KYEVAI FVHGP TTV 

2 941 ATCTTGAAAGAGAATATCAAGTACGAAGTGGCC ATCTTCGT GCACGGACCAACTACTGTG 

ESHGNYSTQVGATQAGRFS I 

3 001 GAGTCGCACGGAAACTACTCCACACAGGTTGGAGCCACTCAGGCAGGGAGATTCAGCATC 

TPAAPSYTLKLGEYGEVTVD 
3 061 ACTCCTGCGGCGCCTTCATACACACTAAAGCTTGGAGAATATGGAGAGGTGACAGTGGAC 

CEPRSGIDTNA Y YVMTVGTK 
3121 TGTGAACC ACGGTC AGGGATTGAC ACCAATGC ATACTACGTGATGACTGTTGG AACAAAG 

TFLVHREWFMDLNLPWSSAG 
3181 ACGTTCTTGGTCCATCGTGAGTGGTTCATGGACCTCAACCTCCCTTGGAGCAGTGCTGGA 

STVWRNRETLMEFEEPHATK 
3241 AGTACTGTGTGGAGGAACAGAGAGACGTTAATGGAGTTTGAGGAACCACACGCCACGAAG 

QSVIALGSQEGALHQALAGA 

33 01 CAGTCTGTGATAGCATTGGGCTCACAAGAGGGAGCTCTGCATCAAGCTTTGGCTGGAGCC 

I P VEFSSNTVKLTSGHLKCR 
3361 ATTCCTGTGGAATTTTCAAGCAACACTGTCAAGTTGACGTCGGGTCATTTGAAGTGTAGA 

VKMEKLQLKGTTYGVCSKAF 
3421 GTGAAGATGGAAAAATTGCAGTTGAAGGGAACAACCTATGGCGTCTGTTCAAAGGCTTTC 

KFLGT PADTG HGTVVLELQY 

34 81 AAGTTTCTTGGGACTCCCGCAGACACAGGTCACGGCACTGTGGTGTTGGAATTGCAGTAC 

TGTDGPCKVPISSAASLNDL 
3541 ACTGGCACGGATGGACCTTGCAAAGTTCCTATCTCGTCAGCGGGTTCATTGAACGACCTA 

TPVGRLVTVNPFVSVATANA 
3601 ACGCCAGTGGGCAGATTGGTCACTGTCAACCCTTTTGTTTCAGTGGCCACGGCCAACGCT 

KVLIELEPPFGDSY IVVGRG 
3 661 AAGGTCCTGATTGAATTGGAACCACCCTTTGGAGACTCATACATAGTGGTGGGCAGAGGA 

EQQINHHWHKSGSS IGKAFT 
3 72 1 GAACAACAGATCAATCACCATTGGCACAAGTCTGGAAGCAGCATTGGCAAAGCCTTTACA 



00150857.DOC 



TTLKGAQRLAALGDT AWDFG 
3781 ACCACCCTCAAAGGAGCGCAGAGACTAGC.CGCTCTAGGAGACACAGCTTGGGACTTTGGA 

SV GGVFTSVGKAVHQVFGGA 
3 841 TCAGTTGGAGGGGTGTTCACCTCAGTTGGGAAGGCTGTCCATCAAGTGTTCGGAGGAGCA 

FRSLFGGMSWITQGLLGALL 
3 901 TTCCGCTC ACTGTTCGGAGGCATGTCCTGGATAACGCAAGGATTGCTGGGGGCTCTCCTG 

LWMGINARDRSIALTFLAVG 

3 961 TTGTGGATGGGCATCAATGCTCGTGATAGGTCCATAGCTCTCACGTTTCTCGCAGTTGGA 

=> C5L 

GVLLFLSVNVHA* 

4 021 GGAGTTCTGCTCTTCCTCTCCGTGAACGTGCACGCTTAATTTTTATCTAGAATCGATC CC 

4 081 GGGTTTTTATGACTAGTTAATCACGGCCGCTTATAAAGATCTAAAATGCATAATTTCTAA 

4141 ATAATGAAAAAAAGTACATCATGAGCAACGCGTTAGTATATTTTACAATGGAGATTAACG 

42 01 CTCTATACCGTTCTATGTTTATTGATTCAGATGATGTTTTAGAAAAGAAAGTTATTGAAT 

4261 A TGAAAA C TTTAA TGAAGA TGAAGA TGACGACGA TGA TTA TTGTTGTAAA TCTGTTTTAG 

4 321 ATGAAGAAGATGACGCGCTAAAGTATACTATGGTTACAAAGTATAAGTCTATACTACTAA 

4 3 81 TGGCGACTTGTGCAAGAAGGTATAGTATAGTGAAAATGTTGTTAGATTATGATTATGAAA 

4441 AACCAAATAAATCAGATCCATATCTAAAGGTATCTCCTTTGCACATAATTTCATCTATTC 

4501 CTAGTTTAGAA TACCTGCAGCCAAGCTTGGCACTGGCCGTCGTTTTACAACGTCGTGACT 

4 561 GGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCT 

4 621 GGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATG 

4681 GCGAATGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCA 

4741 TATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACC 

4 801 CGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGAC 

4 861 AAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAAAC 

4 921 GCGCGAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAA 

4 981 TGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTT 

5041 TATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGC 

5101 TTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTC 

5161 CCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAA 

5221 AAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCG 

52 81 GTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAG 
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5341 TTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCC 

5401 GCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTA 

5461 CGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTG 

5521 CGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACA 

5581 ACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATAC 

5641 CAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTAT 

5701 TAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGG 

5761 ATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATA 

5821 AATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTA 

5881 AGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAA 

5941 ATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAG 

6001 TTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGG 

6061 TGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACT 

6121 GAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCG 

6181 TAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATC 

6241 AAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATA 

6301 CTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTA 

63 61 CATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTC 
6421 TTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGG 

64 81 GGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTAC 
6541 AGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGG 
6601 TAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGT 
6661 ATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCT 
6721 CGTCAGGGGGGCGGAGCCTATGGAAAAACGCC AGCAACGCGGCCTTTTTACGGTTCCTGG 
6781 CCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATA 
6841 ACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCA 
6901 GCGAGTC AGTGAGCGAGGAAGCGGAAGA 
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FIGURE 10 

Construction of pC5 H6p WNV prM-M-E donor plasmids with a truncated H6p +/- 
truncated WNV capsid leader sequence. 







pC5 H6p (t) WNV prM-M-E WNV-L (t) 



pC5 H6p (t) WNV prM-M-E 



FIGURE 11 

ALVAC WNV constructs with truncated H6p +/- truncated leader sequence 
Explanation of terms: 

H6p (t) is the truncated H6p promoter deleted.between Nru I and the 3-end 

WNV-L (t) refers to the truncated WNV capsid leader, which is missing the initiating Met, 
so results in a shorter leader sequence 



H6p 5'-WNV sequence in actual vCP2017: 

=> H6p 

1861 GCTCGTTAATTAATTAGAGCTTCTTTATTCTATACTTAAAAAGTGAAAATAAATACAAAG 



Nru I 

1921 gttcttgagggttgtgttaaattgaaagcgagaaataatcataaattatttcatta|tcgc| 



=> WNV prM start 
SVGAVTL SNFQGKVMMT VNA 
2 041 AGCGTAGGAGCGGTTACCCTCTCTAACTTCCAAGGGAAGGTGATGATGACGGTAAATGCT 

TDVTDVITIPTAAGKNLCIV 
2101 ACTGACGTCACAGATGTCATCACGATTCCAACAGCTGCTGGAAAGAACCTATGC ATTGTC 



1981 



=> WNV capsid leader 
M T G I A V M I G L I A 
^TATCCGTTAAGTTTGTATCGTAA TGACCGGAATTGCAGTCATGATTGGCCTGATCGCC 



PCR primers for H6p (t) WNV prM-M-E WNV-L (t): 



11344. SL 



=> WNV capsid 
Nrul TGIAVMIGL 
ATTATCGCGAACCGGAATTGCAGTCATGATTGGCCTG 



7616 . SL 



KPTIDVKM 
AAGCCTACCATCGATGTGAAGATG 
TTCGGATGGTAGCTACACTTCTAC 



Cla I 



PCR primers for H6p (t) WNV prM-M-E: 



11345. SL 



5' 



=> WNV capsid 
Nrul MTG1AVMIGL 
ATT ATCGCGA ATGACCGGAATTGCAGTCATGATTGGCCTG 



) 



FIGURE 10 

Construction of pC5 H6p WNV prM-M-E donor plasmids with a truncated H6p +/- 
truncated WNV capsid leader sequence. 





Xta1i4Hf,-7| //inHEl (33451 



pC5 H6p WNV prM-M-E 




pC5 H6p (t) WNV prM-M-E WNV-L (t) 



pC5 H6p (t) WNV prM-M~E 



FIGURE 12A 

agtagttcgc ctgtgtgagc tgacaaactt agtagtgttt gtgaggatta acaacaatta 
60 

acacagtgcg agctgtttct tagcacgaag atctcg atg tct aag aaa cca gga 
114 

Met Ser Lys Lys Pro Gly 
1 5 

ggg ccc ggc aag age egg get gtc aat atg eta aaa cgc gga atg ccc 
162 

Gly Pro Gly Lys Ser Arg Ala Val Asn Met Leu Lys Arg Gly Met Pro 
10 15 20 

C 9 C gtg ttg tec ttg att gga ctg aag agg get atg ttg age ctg ate 
210 

Arg Val Leu Ser Leu lie Gly Leu Lys Arg Ala Met Leu Ser Leu lie 
25 30 35 

gac ggc aag ggg cca ata cga ttt gtg ttg get etc ttg gcg ttc ttc 
258 

Asp Gly Lys Gly Pro lie Arg Phe Val Leu Ala Leu Leu Ala Phe Phe 

40 . 45 50 

agg ttc aca gca att get ccg acc cga gca gtg ctg gat cga tgg aga 
306 

Arg Phe Thr Ala lie Ala Pro Thr Arg Ala Val Leu Asp Arg Trp Arg 
55 60 65 70 

ggt gtg aac aaa caa aca gcg atg aaa cac ctt ctg agt ttt aag aag 
354 

Gly Val Asn Lys Gin Thr Ala Met Lys His Leu Leu Ser Phe Lys Lys 
75 80 85 

gaa eta ggg acc ttg acc agt get ate aat egg egg age tea aaa caa 
402 

Glu Leu Gly Thr Leu Thr Ser Ala lie Asn Arg Arg Ser Ser Lys Gin 

90 95 100 

aag aaa aga gga gga aag acc gga att gca gtc atg att ggc ctg ate 
450 

Lys Lys Arg Gly Gly Lys Thr Gly lie Ala Val Met lie Gly Leu lie 
105 110 115 

gee age gta gga gca gtt acc etc tct aac ttc caa ggg aag gtg atg 
498 

Ala Ser Val Gly Ala Val Thr Leu Ser Asn Phe Gin Gly Lys Val Met 
120 125 130 

atg acg gta aat get act gac gtc aca gat gtc ate acg att cca aca 
546 
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FIGURE 12B 

Met Thr Val Asn Ala Thr Asp Val Thr Asp Val He Thr He Pro Thr 
135 140 145 150 

get get gga aag aac eta tgc att gtc aga gca atg gat gtg gga tac 
594 

Ala Ala Gly Lys Asn Leu Cys He Val Arg Ala Met Asp Val Gly Tyr 

155 160 165 

atg tgc gat gat act ate act tat gaa tgc cca gtg ctg teg get ggt 
642 

Met Cys Asp Asp Thr He Thr Tyr Glu Cys Pro Val Leu Ser Ala Gly 

170 175 180 

aat gat cca gaa gac ate gac tgt tgg tgc aca aag tea gca gtc tac 
690 

Asn Asp Pro Glu Asp He Asp Cys Trp Cys Thr Lys Ser Ala Val Tyr 
185 190 195 

gtc agg tat gga aga tgc acc aag aca cgc cac tea aga cgc agt egg 
738 

Val Arg Tyr Gly Arg Cys Thr Lys Thr Arg His Ser Arg Arg Ser Arg 
200 205 210 

agg tea ctg aca gtg cag aca cac gga gaa age act eta gcg aac aag 
786 

Arg Ser Leu Thr Val Gin Thr His Gly Glu Ser Thr Leu Ala Asn Lys 
215 220 225 230 

aa g ggg 9 ct tgg at g g ac a g c acc aa g g cc aca a gg tat ttg g ta aaa 

834 

Lys Gly Ala Trp Met Asp Ser Thr Lys Ala Thr Arg Tyr Leu Val Lys 
235 240 245 

aca gaa tea tgg ate ttg agg aac cct gga tat gcc ctg gtg gca gcc 
882 

Thr Glu Ser Trp He Leu Arg Asn Pro Gly Tyr Ala Leu Val Ala Ala 
250 255 260 

gtc att ggt tgg atg ctt ggg agc aac acc atg cag aga gtt gtg ttt 
930 

Val He Gly Trp Met Leu Gly Ser Asn Thr Met Gin Arg Val Val Phe 
265 270 275 

gtc gtg eta ttg ctt ttg gtg gcc cca get tac agc ttc aac tgc ctt 
978 

Val Val Leu Leu Leu Leu Val Ala Pro Ala Tyr Ser Phe Asn Cys Leu 
280 285 290 

gga atg agc aac aga gac ttc ttg gaa gga gtg tct gga gca aca tgg 
1026 
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FIGURE 12C 

Gly Met Ser Asn Arg Asp Phe Leu Glu Gly Val Ser Gly Ala Thr Trp 
295 300 305 310 

gtg gat ttg gtt etc gaa ggc gac age tgc gtg act ate atg tct aag 
1074 

Val Asp Leu Val Leu Glu Gly Asp Ser Cys Val Thr He Met Ser Lys 
315 320 325 

gac aag cct acc ate gat gtg aag atg atg aat atg gag gcg gee aac 
1122 

Asp Lys Pro Thr He Asp Val Lys Met Met Asn Met Glu Ala Ala Asn 
330 335 340 

ctg gca gag gtc cgc agt tat tgc tat ttg get acc gtc age gat etc 
1170 

Leu Ala Glu Val Arg Ser Tyr Cys Tyr Leu Ala Thr Val Ser Asp Leu 
345 350 355 

tec acc aaa get gcg tgc ccg acc atg gga gaa get cac aat gac aaa 
1218 

Ser Thr Lys Ala Ala Cys Pro Thr Met Gly Glu Ala His Asn Asp Lys 

360 365 370 

cgt get gac cca get ttt gtg tgc aga caa gga gtg gtg gac agg ggc 
1266 

Arg Ala Asp Pro Ala Phe Val Cys Arg Gin Gly Val Val Asp Arg Gly 
375 380 385 . 390 

t99 99 c aac 99 c t9 c 99 a cta fctt 99 c aaa 99 a a 9 c att 9 ac aca t9 c 
1314 

Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser He Asp Thr Cys 
395 400 405 

gee aaa ttt gee tgc tct acc aag gca ata gga aga acc ate ttg aaa 
1362 

Ala Lys Phe Ala Cys Ser Thr Lys Ala He Gly Arg Thr He Leu Lys 
410 415 420 

gag aat ate aag tac gaa gtg gee att ttt gtc cat gga cca act act 
1410 

Glu Asn He Lys Tyr Glu Val Ala He Phe Val His Gly Pro Thr Thr 
425 430 435 

gtg gag teg cac gga aac tac tec aca cag gtt gga gee act cag gca 
1458 

Val Glu Ser His Gly Asn Tyr Ser Thr Gin Val Gly Ala Thr Gin Ala 
440 445 450 

999 a 9 a ttc a 9C a tc act cct gcg gcg cct tea tac aca cta aag ctt 
1506 
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Gly Arg Phe Ser lie 
455 

gga gaa tat gga gag 
1554 

Gly Glu Tyr Gly Glu 
475 

gac acc aat gca tac 
1602 

Asp Thr Asn Ala Tyr 
490 

gtc cat cgt gag tgg 
1650 

Val His Arg Glu Trp 
505 

gga agt act gtg tgg 
1698 

Gly Ser Thr Val Trp 
520 

cca cac gcc acg aag 
1746 

Pro His Ala Thr Lys 
535 

get ctg cat caa get 
1794 

Ala Leu His Gin Ala 
555 

aac act gtc aag ttg 
1842 

Asn Thr Val Lys Leu 
570 

gaa aaa ttg cag ttg 
1890 

Glu Lys Leu Gin Leu 
585 

ttc aag ttt ctt ggg 
1938 

Phe Lys Phe Leu Gly 
600 

ttg gaa ttg cag tac 
1986 



FIGURE 12D 
Thr Pro Ala Ala Pro Ser 
460 465 

gtg aca gtg gac tgt gaa 

Val Thr Val Asp Cys Glu 
480 

tac gtg atg act gtt gga 

Tyr Val Met Thr Val Gly 
495 

ttc atg gac etc aac etc 

Phe Met Asp Leu Asn Leu 
510 

agg aac aga gag acg tta 

Arg Asn Arg Glu Thr Leu 
525 

cag tct gtg ata gca ttg 

Gin Ser Val lie Ala Leu 
540 545 

ttg get gga gcc att cct 

Leu Ala Gly Ala lie Pro 
560 

acg teg ggt cat ttg aag 

Thr Ser Gly His Leu Lys 
575 

aag gga aca acc tat ggc 

Lys Gly Thr Thr Tyr Gly 
590 

act ccc gca gac aca ggt 

Thr Pro Ala Asp Thr Gly 
605 

act ggc acg gat gga cct 



Tyr Thr Leu Lys Leu 
470 

cca egg tea ggg att 

Pro Arg Ser Gly lie 
485 

aca aag acg ttc ttg 

Thr Lys Thr Phe Leu 
500 

cct tgg age agt get 

Pro Trp Ser Ser Ala 
515 

atg gag ttt gag gaa 

Met Glu Phe Glu Glu 
530 

ggc tea caa gag gga 

Gly Ser Gin Glu Gly 
550 

gtg gaa ttt tea age 

Val Glu Phe Ser Ser 
565 

tgt aga gtg aag atg 

Cys Arg Val Lys Met 
580 

gtc tgt tea aag get 

Val Cys Ser Lys Ala 
595 

cac ggc act gtg gtg 

His Gly Thr Val Val 
610 

tgc aaa gtt cct ate 
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FIGURE 12E 

Leu Glu Leu Gin Tyr Thr Gly Thr Asp Gly Pro Cys Lys Val Pro lie 
615 620 625 630 

teg tea gtg get tea ttg aac gac eta acg cca gtg ggc aga ttg gtc 
2034 

Ser Ser Val Ala Ser Leu Asn Asp Leu Thr Pro Val Gly Arg Leu Val 
635 640 645 

act gtc aac cct ttt gtt tea gtg gee acg gee aac get aag gtc ctg 
2082 

Thr Val Asn Pro Phe Val Ser Val Ala Thr Ala Asn Ala Lys Val Leu 
650 655 660 

att gaa ttg gaa cca ccc ttt gga gac tea tac ata gtg gtg ggc aga 
2130 

lie Glu Leu Glu Pro Pro Phe Gly Asp Ser Tyr lie Val Val Gly Arg 
665 670 675 

gga gaa caa cag ate aat cac cat tgg cac aag tct gga age age att 
2178 

Gly Glu Gin Gin lie Asn His His Trp His Lys Ser Gly Ser Ser lie 
680 685 690 

ggc aaa gee ttt aca acc acc etc aaa gga gcg cag aga eta gee get 
2226 

Gly Lys Ala Phe Thr Thr Thr Leu Lys Gly Ala Gin Arg Leu Ala Ala 
695 700 705 710 

eta gga gac aca get tgg gac ttt gga tea gtt gga ggg gtg ttc acc 
2274 

Leu Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly Gly Val Phe Thr 
715 720 725 

tea gtt ggg aag get gtc cat caa gtg ttc gga gga gca ttc cgc tea 
2322 

Ser Val Gly Lys Ala Val His Gin Val Phe Gly Gly Ala Phe Arg Ser 
730 735 740 

ctg ttc gga ggc atg tec tgg ata acg caa gga ttg ctg ggg get etc 
2370 

Leu Phe Gly Gly Met Ser Trp lie Thr Gin Gly Leu Leu Gly Ala Leu 
745 750 755 

ctg ttg tgg atg ggc ate aat get cgt gat agg tec ata get etc acg 
2418 

Leu Leu Trp Met Gly lie Asn Ala Arg Asp Arg Ser lie Ala Leu Thr 
760 765 770 

ttt etc gca gtt gga gga gtt ctg etc ttc etc tec gtg aac gtg cac 
24 66 
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FIGURE 12F 

Phe Leu Ala Val Gly Gly Val Leu Leu Phe Leu Ser Val Asn Val His 
775 780 785 790 

get gac act ggg tgt gec ata gac ate age egg caa gag ctg aga tgt 
2514 

Ala Asp Thr Gly Cys Ala lie Asp lie Ser Arg Gin Glu Leu Arg Cys 
795 800 805 

gga agt gga gtg ttc ata cac aat gat gtg gag get tgg atg gac egg 
2562 

Gly Ser Gly Val Phe lie His Asn Asp Val Glu Ala Trp Met Asp Arg 
810 815 820 

tac aag tat tac cct gaa acg cca caa ggc eta gee aag ate att cag 
2610 

Tyr Lys Tyr Tyr Pro Glu Thr Pro Gin Gly Leu Ala Lys lie lie Gin 
825 830 835 

aaa get cat aag gaa gga gtg tgc ggt eta cga tea gtt tec aga ctg 
2658 

Lys Ala His Lys Glu Gly Val Cys Gly Leu Arg Ser Val Ser Arg Leu 
840 845 850 

gag cat caa atg tgg gaa gca gtg aag gac gag ctg aac act ctt ttg 
2706 

Glu His Gin Met Trp Glu Ala Val Lys Asp Glu Leu Asn Thr Leu Leu 
855 860 865 870 

aag gag aat ggt gtg gac ctt agt gtc gtg gtt gag aaa cag gag gga 
2754 

Lys Glu Asn Gly Val Asp Leu Ser Val Val Val Glu Lys Gin Glu Gly 
875 880 885 

atg tac aag tea gca cct aaa cgc etc acc gee ace acg gaa aaa ttg 
2802 

Met Tyr Lys Ser Ala Pro Lys Arg Leu Thr Ala Thr Thr Glu Lys Leu 
890 895 900 

gaa att ggc tgg aag gec tgg gga aag agt att tta ttt gca cca gaa 
2850 

Glu lie Gly Trp Lys Ala Trp Gly Lys Ser lie Leu Phe Ala Pro Glu 
905 910 915 

etc gee aac aac acc ttt gtg gtt gat ggt ccg gag acc aag gaa tgt 
2898 

Leu Ala Asn Asn Thr Phe Val Val Asp Gly Pro Glu Thr Lys Glu Cys 
920 925 930 

ccg act cag aat cgc get tgg aat age tta gaa gtg gag gat ttt gga 
2946 
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FIGURE 12G 

Pro Thr Gin Asn Arg Ala Trp Asn Ser Leu Glu Val Glu Asp Phe Gly 
935 940 945 950 

ttt ggt etc acc age act egg atg ttc ctg aag gtc aga gag age aac 
2994 

Phe Gly Leu Thr Ser Thr Arg Met Phe Leu Lys Val Arg Glu Ser Asn 
955 960 965 

aca act gaa tgt gac teg aag ate att gga acg get gtc aag aac aac 
3042 

Thr Thr Glu Cys Asp Ser Lys lie lie Gly Thr Ala Val Lys Asn Asn 
970 975 980 

ttg gcg ate cac agt gac ctg tec tat tgg att gaa age agg etc aat 
3090 

Leu Ala lie His Ser Asp Leu Ser Tyr Trp lie Glu Ser Arg Leu Asn 
985 990 995 

gat acg tgg aag ctt gaa agg gca gtt ctg ggt gaa gtc aaa tea 
3135 

Asp Thr Trp Lys Leu Glu Arg Ala Val Leu Gly Glu Val Lys Ser 
1000 1005 1010 

tgt acg tgg cct gag acg cat acc ttg tgg ggc gat gga ate ctt 
3180 

Cys Thr Trp Pro Glu Thr His Thr Leu Trp Gly Asp Gly lie Leu 
1015 1020 1025 

gag agt gac ttg ata ata cca gtc aca ctg gcg gga cca cga age 
3225 

Glu Ser Asp Leu lie lie Pro Val Thr Leu Ala Gly Pro Arg Ser 
1030 1035 1040 

aat cac aat egg aga cct ggg tac aag aca caa aac cag ggc cca 
3270 

Asn His Asn Arg Arg Pro Gly Tyr Lys Thr Gin Asn Gin Gly Pro 
1045 1050 1055 

fc 99 9 ac 9 aa 99 c c 99 9 ta 9 a 9 att 9 ac ttc 9 at tac fc 9 c cca 99 a 
3315 

Trp Asp Glu Gly Arg Val Glu lie Asp Phe Asp Tyr Cys Pro Gly 
1060 1065 1070 

act acg gtc acc ctg agt gag age tgc gga cac cgt gga cct gee 
3360 

Thr Thr Val Thr Leu Ser Glu Ser Cys Gly His Arg Gly Pro Ala 
1075 1080 1085 

act cgc acc acc aca gag age gga aag ttg ata aca gat tgg tgc 
3405 
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Thr Arg Thr Thr Thr 
1090 

tgc agg age tgc acc 
3450 

Cys Arg Ser Cys Thr 
1105 

ggc tgt tgg tat ggt 
3495 

Gly Cys Trp Tyr Gly 
1120 

aag acc etc gtg cag 
3540 

Lys Thr Leu Val Gin 
1135 

att gac cct ttt cag 
3585 

lie Asp Pro Phe Gin 
1150 

cag gag gtc ctt cgc 
3630 

Gin Glu Val Leu Arg 
1165 

get ata ctg att get 
3675 

Ala lie Leu lie Ala 
1180 

tac act gat gtg tta 
3720 

Tyr Thr Asp Val Leu 
1195 

gca gaa tct aat teg 
3765 

Ala Glu Ser Asn Ser 
1210 

gcg acc ttc aag ata 
3810 

Ala Thr Phe Lys lie 
1225 

aaa gcg aga tgg acc 
3855 



FIGURE 12H 
Glu Ser Gly Lys Leu lie 
1095 

tta cca cca ctg cgc tac 

Leu Pro Pro Leu Arg Tyr 
1110 

atg gag ate aga cca cag 

Met Glu lie Arg Pro Gin 
1125 

tea caa gtg aat get tat 

Ser Gin Val Asn Ala Tyr 
1140 

ttg ggc ctt ctg gtc gtg 

Leu Gly Leu Leu Val Val 
1155 

aag agg tgg aca gee aag 

Lys Arg Trp Thr Ala Lys 
1170 

ctg eta gtc ctg gtg ttt 

Leu Leu Val Leu Val Phe 
1185 

cgc tat gtc ate ttg gtg 

Arg Tyr Val lie Leu Val 
1200 

99 a 99 a 9 ac gtg g ta cac 

Gly Gly Asp Val Val His 
1215 

caa cca gtg ttt atg gtg 

Gin Pro Val Phe Met Val 
1230 

aac cag gag aac att ttg 



Thr Asp Trp Cys 
1100 

caa act gac age 

Gin Thr Asp Ser 
1115 

aga cat gat gaa 

Arg His Asp Glu 
1130 

aat get gat atg 

Asn Ala Asp Met 
1145 

ttc ttg gee acc 

Phe Leu Ala Thr 
1160 

ate age atg cca 

lie Ser Met Pro 
1175 

999 99C a tt act 

Gly Gly He Thr 
1190 

ggg gca get ttc 

Gly Ala Ala Phe 
1205 

ttg gcg etc atg 

Leu Ala Leu Met 
1220 

gca teg ttt etc 

Ala Ser Phe Leu 
1235 

ttg atg ttg gcg 
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FIGURE 121 

Lys Ala Arg Trp Thr Asn Gin Glu Asn lie Leu Leu Met Leu Ala 
1240 1245 1250 

get gtt ttc ttt caa atg get tat cac gat gec cgc caa att ctg 
3900 

Ala Val Phe Phe Gin Met Ala Tyr His Asp Ala Arg Gin lie Leu 
1255 1260 1265 

etc tgg gag ate cct gat gtg ttg aat tea ctg gcg gta get tgg 
3945 

Leu Trp Glu lie Pro Asp Val Leu Asn Ser Leu Ala Val Ala Trp 
1270 1275 1280 

atg ata ctg aga gee ata aca ttc aca acg aca tea aac gtg gtt 
3990 

Met He Leu Arg Ala He Thr Phe Thr Thr Thr Ser Asn Val Val 
1285 1290 1295 

gtt ccg ctg eta gee ctg eta aca ccc ggg ctg aga tgc ttg aat 
4035 

Val Pro Leu Leu Ala Leu Leu Thr Pro Gly Leu Arg Cys Leu Asn 
1300 1305 1310 

ctg gat gtg tac agg ata ctg ctg ttg atg gtc gga ata ggc age 
4080 

Leu Asp Val Tyr Arg He Leu Leu Leu Met Val Gly He Gly Ser 
1315 1320 1325 

ttg ate agg gag aag agg agt gca get gca aaa aag aaa gga gca 
4125 

Leu He Arg Glu Lys Arg Ser Ala Ala Ala Lys Lys Lys Gly Ala 
1330 1335 1340 

agt ctg eta tgc ttg get eta gee tea aca gga ctt ttc aac ccc 
4170 

Ser Leu Leu Cys Leu Ala Leu Ala Ser Thr Gly Leu Phe Asn Pro 
1345 1350 1355 

atg ate ctt get get gga ctg att gca tgt gat ccc aac cgt aaa 
4215 

Met He Leu Ala Ala Gly Leu He Ala Cys Asp Pro Asn Arg Lys 
1360 1365 1370 

cgc gga tgg ccc gca act gaa gtg atg aca get gtc ggc eta atg 
4260 

Arg Gly Trp Pro Ala Thr Glu Val Met Thr Ala Val Gly Leu Met 
1375 1380 1385 

ttt gee ate gtc gga ggg ctg gca gag ctt gac att gac tec atg 
4305 
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Phe Ala He Val Gly 
1390 

gcc att cca atg act 
4350 

Ala He Pro Met Thr 
1405 

att tct ggg aaa tea 
4395 

He Ser Gly Lys Ser 
1420 

att tec tgg gaa agt 
4440 

He Ser Trp Glu Ser 
1435 

gtt gat gtg egg ctt 
4485 

Val Asp Val Arg Leu 
1450 

gat cca gga gca cct 
4530 

Asp Pro Gly Ala Pro 
1465 

etc gcg att agt gcg 
4575 

Leu Ala He Ser Ala 
1480 

gtt gga ttt tgg ata 
4620 

Val Gly Phe Trp He 
1495 

ttg tgg gac act ccc 
4665 

Leu Trp Asp Thr Pro 
1510 

acc acc ggc gtc tac 
4710 

Thr Thr Gly Val Tyr 
1525 

tat caa gca gga gcg 
4755 



FIGURE 12J 
Gly Leu Ala Glu Leu 
1395 

ate gcg ggg etc atg 

He Ala Gly Leu Met 
1410 

aca gat atg tgg att 

Thr Asp Met Trp He 
1425 

gat gca gaa att aca 

Asp Ala Glu He Thr 
1440 

gat gat gat gga aac 

Asp Asp Asp Gly Asn 
1455 

tgg aag ata tgg atg 

Trp Lys He Trp Met 
1470 

tac acc ccc tgg gca 

Tyr Thr Pro Trp Ala 
1485 

act etc caa tac aca 

Thr Leu Gin Tyr Thr 
1500 

tea cca aag gag tac 

Ser Pro Lys Glu Tyr 
1515 

agg ate atg act cgt 

Arg He Met Thr Arg 
1530 

ggc gtg atg gtt gaa 



Asp He Asp Ser Met 
1400 

ttt get get ttc gtg 

Phe Ala Ala Phe Val 
1415 

gag aga acg gcg gac 

Glu Arg Thr Ala Asp 
1430 

ggc teg age gaa aga 

Gly Ser Ser Glu Arg 
1445 

ttc cag etc atg aat 

Phe Gin Leu Met Asn 
1460 

etc aga atg gtc tgt 

Leu Arg Met Val Cys 
1475 

ate ttg ccc tea gta 

He Leu Pro Ser Val 
1490 

aag aga gga ggc gtg 

Lys Arg Gly Gly Val 
1505 

aaa aag ggg gac acg 

Lys Lys Gly Asp Thr 
1520 

ggg ctg etc ggc agt 

Gly Leu Leu Gly Ser 
1535 

ggt gtt ttc cac acc 
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FIGURE 12K 

Tyr Gin Ala Gly Ala Gly Val Met Val Glu Gly Val Phe His Thr 
1540 1545 1550 

ctt tgg cat aca aca aaa gga gcc get ttg atg age gga gag ggc 
4800 

Leu Trp His Thr Thr Lys Gly Ala Ala Leu Met Ser Gly Glu Gly 
1555 1560 1565 

cgc ctg gac cca tac tgg ggc agt gtc aag gag gat cga ctt tgt 
4845 

Arg Leu Asp Pro Tyr Trp Gly Ser Val Lys Glu Asp Arg Leu Cys 
1570 1575 1580 

tac gga gga ccc tgg aaa ttg cag cac aag tgg aac ggg cag gat 
4890 

Tyr Gly Gly Pro Trp Lys Leu Gin His Lys Trp Asn Gly Gin Asp 
1585 1590 1595 

gag gtg cag atg att gtg gtg gaa cct ggc aag. aac gtt aag aac 
4935 

Glu Val Gin Met He Val Val Glu Pro Gly Lys Asn Val Lys Asn 
1600 1605 1610 

gtc cag acg aaa cca ggg gtg ttc aaa aca cct gaa gga gaa ate 
4980 

Val Gin Thr Lys Pro Gly Val Phe Lys Thr Pro Glu Gly Glu He 
1615 1620 • 1625 

999 9 CC 9 fc 9 act tt: 9 9 ac ttc ccc act 99 a aca tca 99 c tca cca 
5025 

Gly Ala Val Thr Leu Asp Phe Pro Thr Gly Thr Ser Gly Ser Pro 

1630 1635 1640 

ata gtg gac aaa aac ggt gat gtg att ggg ctt tat ggc aat gga 
5070 

He Val Asp Lys Asn Gly Asp Val He Gly Leu Tyr Gly Asn Gly 
1645 1650 1655 

gtc ata atg ccc aac ggc tca tac ata age gcg ata gtg cag ggt 
5115 

Val He Met Pro Asn Gly Ser Tyr He Ser Ala He Val Gin Gly 
1660 1665 1670 

gaa agg atg gat gag cca ate cca gcc gga ttc gaa cct gag atg 
5160 

Glu Arg Met Asp Glu Pro He Pro Ala Gly Phe Glu Pro Glu Met 
1675 1680 1685 

ctg agg aaa aaa cag ate act gta ctg gat etc cat ccc ggc gcc 
5205 



00150681.DOC 



FIGURE 12L 

Leu Arg Lys Lys Gin lie Thr Val Leu Asp Leu His Pro Gly Ala 
1690 1695 1700 

ggt aaa aca agg agg att ctg cca cag ate ate aaa gag gec ata 
5250 

Gly Lys Thr Arg Arg lie Leu Pro Gin lie lie Lys Glu Ala lie 
1705 1710 1715 

aac aga aga ctg aga aca gec gtg eta gca cca acc agg gtt gtg 
5295 

Asn Arg Arg Leu Arg Thr Ala Val Leu Ala Pro Thr Arg Val Val 
1720 1725 1730 

get get gag atg get gaa gca ctg aga gga ctg ccc ate egg tac 
5340 

Ala Ala Glu Met Ala Glu Ala Leu Arg Gly Leu Pro lie Arg Tyr 
1735 1740 i 1745 

cag aca tec gca gtg ccc aga gaa cat aat gga aat gag att gtt 
5385 

Gin Thr Ser Ala Val Pro Arg Glu His Asn Gly Asn Glu lie Val 
1750 1755 : 1760 

gat gtc atg tgt cat get acc etc acc cac agg ctg atg tct cct 
5430 

Asp Val Met Cys His Ala Thr Leu Thr His Arg Leu Met Ser Pro 
1765 1770 1775 

cac agg gtg ccg aac tac aac ctg ttc gtg atg gat gag get cat 
5475 

His Arg Val Pro Asn Tyr Asn Leu Phe Val Met Asp Glu Ala Hi's 
1780 1785 , 1790 

ttc acc gac cca get age att. gca gca aga ggt tac att tec aca 
5520 

Phe Thr Asp Pro Ala Ser lie Ala Ala Arg Gly Tyr lie Ser Thr 
1795 1800 1805 

aag gtc gag eta ggg gag gcg gcg gca ata ttc atg aca gee acc 
5565 

Lys Val Glu Leu Gly Glu Ala Ala Ala He Phe Met Thr Ala Thr 
1810 1815 1820 

cca cca ggc act tea gat cca ttc cca gag tec aat tea cca att 
5610 

Pro Pro Gly Thr Ser Asp Pro Phe Pro Glu Ser Asn Ser Pro He 
1825 1830 1835 

tec gac tta cag act gag ate ccg gat cga get tgg aac tct gga 
5655 
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Ser Asp Leu Gin Thr 
1840 

tac gaa tgg ate aca 
5700 

Tyr Glu Trp lie Thr 
1855 

cct agt gtc aag atg 
5745 

Pro Ser Val Lys Met 
1870 

get gga aag aaa gta 
5790 

Ala Gly Lys Lys Val 
1885 

gag tac cca aaa tgt 
5835 

Glu Tyr Pro Lys Cys 
1900 

aca gac ata tct gaa 
5880 

Thr Asp lie Ser Glu 
1915 

att gac age egg aag 
5925 

lie Asp Ser Arg Lys 
1930 

gaa ggg aga gtg ate 
5970 

Glu Gly Arg Val He 
1945 

a 9t 9 CC 9 CC ca 9 a 9 a 
6015 

Ser Ala Ala Gin Arg 
1960 

gtt ggt gat gag tac 
6060 

Val Gly Asp Glu Tyr 
1975 

teg aac ttc gee cat 
6105 



FIGURE 12M 
Glu He Pro Asp Arg 
1845 

gaa tac acc ggg aag 

Glu Tyr Thr Gly Lys 
1860 

ggg aa t gag att gee 

Gly Asn Glu He Ala 
1875 

gtc caa ttg aac aga 

Val Gin Leu Asn Arg 
1890 

aag aac gat gat tgg 

Lys Asn Asp Asp Trp 
1905 

atg ggg get aac ttc 

Met Gly Ala Asn Phe 
1920 

agt gtg aaa cca acc 

Ser Val Lys Pro Thr 
1935 

ctg gga gaa cca tct 

Leu Gly Glu Pro Ser 
1950 

c 9t 99 a c 9t atc ggt 

Arg Gly Arg He Gly 
1965 

tgt tat ggg ggg cac 

Cys Tyr Gly Gly His 
1980 

tgg act gag gca cga 



Ala Trp Asn Ser Gly 
1850 

acg gtt tgg ttt gtg 

Thr Val Trp Phe Val 
1865 

ctt tgc eta caa cgt 

Leu Cys Leu Gin Arg 
1880 

aag teg tac gag acg 

Lys Ser Tyr Glu Thr 
1895 

gac ttt gtt atc aca 

Asp Phe Val He Thr 
1910 

aag gcg age agg gtg 

Lys Ala Ser Arg Val 
1925 

atc ata aca gaa gga 

He He Thr Glu Gly 
1940 

gca gtg aca gca get 

Ala Val Thr Ala Ala 
1955 

aga aat ccg teg caa 

Arg Asn Pro Ser Gin 
1970 

acg aat gaa gac gac 

Thr Asn Glu Asp Asp 
1985 

atc atg ctg gac aac 
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FIGURE 12N 

Ser Asn Phe Ala His Trp Thr Glu Ala Arg lie Met Leu Asp Asn 
1990 1995 2000 

ate aac atg cca aac gga ctg ate get caa ttc tac caa cca gag 
6150 

lie Asn Met Pro Asn Gly Leu lie Ala Gin Phe Tyr Gin Pro Glu 

2005 2010 2015 

cgt gag aag gta tat acc atg gat ggg gaa tac egg etc aga gga 
6195 

Arg Glu Lys Val Tyr Thr Met Asp Gly Glu Tyr Arg Leu Arg Gly 
2020 2025 2030 

gaa gag aga aaa aac ttt ctg gaa ctg ttg agg act gca gat ctg 
6240 

Glu Glu Arg Lys Asn Phe Leu Glu Leu Leu Arg Thr Ala Asp Leu 
2035 ~ 2040 2045 

cca gtt tgg ctg get tac aag gtt gca gcg get gga gtg tea tac 
6285 

Pro Val Trp Leu Ala Tyr Lys Val Ala Ala Ala Gly Val Ser Tyr 
2050 2055 2060 

cac gac egg agg tgg tgc ttt gat ggt cct agg aca aac aca att 
6330 

His Asp Arg Arg Trp Cys Phe Asp Gly Pro Arg Thr Asn Thr lie 

2065 2070 2075 

tta gaa gac aac aac gaa gtg gaa gtc ate acg aag ctt ggt gaa 
63 75 

Leu Glu Asp Asn Asn Glu Val Glu Val lie Thr Lys Leu Gly Glu 
2080 2085 2090 

agg aag att ctg agg ccg cgc tgg att gac gec agg gtg tac teg 
6420 

Arg Lys lie Leu Arg Pro Arg Trp lie Asp Ala Arg Val Tyr Ser 
2095 2100 2105 

gat cac cag gca eta aag gcg ttc aag gac ttc gee teg gga aaa 
6465 

Asp His Gin Ala Leu Lys Ala Phe Lys Asp Phe Ala Ser Gly Lys 
2110 2115 2120 

cgt tct cag ata ggg etc att gag gtt ctg gga aag atg cct gag 
6510 

Arg Ser Gin lie Gly Leu lie Glu Val Leu Gly Lys Met Pro Glu 
2125 2130 2135 

cac ttc atg ggg aag aca tgg gaa gca ctt gac acc atg tac gtt 
6555 
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FIGURE 120 

His Phe Met Gly Lys Thr Trp Glu Ala Leu Asp Thr Met Tyr Val 
2140 2145 2150 

gtg gcc act gca gag aaa gga gga aga get cac aga atg gec ctg 
6600 

Val Ala Thr Ala Glu Lys Gly Gly Arg Ala His Arg Met Ala Leu 
2155 2160 2165 

gag gaa ctg cca gat get ctt cag aca att gcc ttg att gcc tta 
6645 

Glu Glu Leu Pro Asp Ala Leu Gin Thr lie Ala Leu lie Ala Leu 
2170 2175 2180 

ttg agt gtg atg acc atg gga gta ttc ttc etc etc atg cag egg 
6690 

Leu Ser Val Met Thr Met Gly Val Phe Phe Leu Leu Met Gin Arg 
2185 2190 2195 

aag ggc att gga aag ata ggt ttg gga ggc get gtc ttg gga gtc 
6735 

Lys Gly He Gly Lys He Gly Leu Gly Gly Ala Val Leu Gly Val 
2200 2205 2210 

gcg acc ttt ttc tgt tgg atg get gaa gtt cca gga acg aag ate 
6780 

Ala Thr Phe Phe Cys Trp Met Ala Glu Val Pro Gly Thr Lys He 
2215 2220 2225 

gcc gga atg ttg ctg etc tec ctt etc ttg atg att gtg eta att 
6825 

Ala Gly Met Leu Leu Leu Ser Leu Leu Leu Met He Val Leu He 
2230 2235 2240 

cct gag cca gag aag caa cgt teg cag aca gac aac cag eta gcc 
6870 

Pro Glu Pro Glu Lys Gin Arg Ser Gin Thr Asp Asn Gin Leu Ala 
2245 2250 2255 

gtg ttc ctg att tgt gtc atg acc ctt gtg age gca gtg gca gcc 
6915 

Val Phe Leu He Cys Val Met Thr Leu Val Ser Ala Val Ala Ala 
2260 2265 2270 

aac gag atg ggt tgg eta gat aag acc aag agt gac ata age agt 
6960 

Asn Glu Met Gly Trp Leu Asp Lys Thr Lys Ser Asp lie Ser Ser 
2275 2280 2285 

ttg ttt ggg caa aga att gag gtc aag gag aat ttc age atg gga 
7005 



00150681.DOC 



FIGURE 12P 

Leu Phe Gly Gin Arg lie Glu Val Lys Glu Asn Phe Ser Met Gly 
2290 2295 2300 

gag ttt ctt ttg gac ttg agg ccg gca aca gcc tgg tea ctg tac 
7050 

Glu Phe Leu Leu Asp Leu Arg Pro Ala Thr Ala Trp Ser Leu Tyr 
2305 2310 2315 

get gtg aca aca gcg gtc etc act cca ctg eta aag cat ttg ate 
7095 

Ala Val Thr Thr Ala Val Leu Thr Pro Leu Leu Lys His Leu lie 

2320 2325 2330 

acg tea gat tac ate aac ace tea ttg acc tea ata aac gtt cag 
7140 

Thr Ser Asp Tyr lie Asn Thr Ser Leu Thr Ser lie Asn Val Gin 
2335 2340 2345 

gca agt gca eta ttc aca etc gcg cga ggc ttc ccc ttc gtc gat 
7185 

Ala Ser Ala Leu Phe Thr Leu Ala Arg Gly Phe Pro Phe Val Asp 
2350 2355 2360 

gtt gga gtg teg get etc ctg eta gca gcc gga tgc tgg gga caa 
7230 

Val Gly Val Ser Ala Leu Leu Leu Ala Ala Gly Cys Trp Gly Gin 
2365 2370 2375 

gtc acc etc acc gtt acg gta aca gcg gca aca etc ctt ttt tgc 
7275 

Val Thr Leu Thr Val Thr Val Thr Ala Ala Thr Leu Leu Phe Cys 
2380 2385 2390 

cac tat gcc tac atg gtt ccc ggt tgg caa get gag gca atg cgc 
7320 

His Tyr Ala Tyr Met Val Pro Gly Trp Gin Ala Glu Ala Met Arg 
2395 2400 2405 

tea gcc cag egg egg aca gcg gcc gga ate atg aag aac get gta 
7365 

Ser Ala Gin Arg Arg Thr Ala Ala Gly lie Met Lys Asn Ala Val 
2410 2415 2420 

gtg gat ggc ate gtg gcc acg gac gtc cca gaa tta gag cgc acc 
7410 

Val Asp Gly He Val Ala Thr Asp Val Pro Glu Leu Glu Arg Thr 
2425 2430 2435 

aca ccc ate atg cag aag aaa gtt gga cag ate atg ctg ate ttg 
7455 
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FIGURE 12Q 

Thr Pro lie Met Gin Lys Lys Val Gly Gin lie Met Leu lie Leu 
2440 2445 2450 

gtg tct eta get gca gta gta gtg aac ccg tct gtg aag aca gta 
7500 

Val Ser Leu Ala Ala Val Val Val Asn Pro Ser Val Lys Thr Val 
2455 2460 2465 

cga gaa gec gga att ttg ate acg gec gca gcg gtg acg ctt tgg 
7545 

Arg Glu Ala Gly lie Leu lie Thr Ala Ala Ala Val Thr Leu Trp 
2470 2475 2480 

gag aat gga gca age tct gtt tgg aac gca aca act gec ate gga 
7590 

Glu Asn Gly Ala Ser Ser Val Trp Asn Ala Thr Thr Ala lie Gly 
2485 2490 2495 

etc tgc cac ate atg cgt ggg ggt tgg ttg tea tgt eta tec ata 
7635 

Leu Cys His lie Met Arg Gly Gly Trp Leu Ser Cys Leu Ser lie 
2500 2505 2510 

aca tgg aca etc ata aag aac atg gaa aaa cca gga eta aaa aga 
7680 

Thr Trp Thr Leu lie Lys Asn Met Glu Lys Pro Gly Leu Lys Arg 
2515 2520 2525 

ggt ggg g ca aaa gg a °9 c acc ttg gg a g a g gtt tgg aaa gaa aga 

7725 

Gly Gly Ala Lys Gly Arg Thr Leu Gly Glu Val Trp Lys Glu Arg 
2530 2535 2540 

etc aac cag atg aca aaa gaa gag ttc act agg tac cgc aaa gag 
7770 

Leu Asn Gin Met Thr Lys Glu Glu Phe Thr Arg Tyr Arg Lys Glu 
2545 2550 2555 

gec ate ate gaa gtc gat cgc tea gcg gca aaa cac gee agg aaa 
7815 

Ala lie lie Glu Val Asp Arg Ser Ala Ala Lys His Ala Arg Lys 
2560 2565 2570 

gaa ggc aat gtc act gga ggg cat cca gtc tct agg ggc aca gca 
7860 

Glu Gly Asn Val Thr Gly Gly His Pro Val Ser Arg Gly Thr Ala 
2575 2580 2585 

aaa ctg aga tgg ctg gtc gaa egg agg ttt etc gaa ccg gtc gga 
7905 
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Lys Leu Arg Trp Leu 
2590 

aaa gtg att gac ctt 
7950 

Lys Val lie Asp Leu 
2605 

at 9 9 ca acc caa aaa 
7995 

Met Ala Thr Gin Lys 
2620 

ggc ggt ccc gga cat 
8040 

Gly Gly Pro Gly His 
2635 

tgg aac att gtc acc 
8085 

Trp Asn lie Val Thr 
2650 

cct tct gag tgt tgt 
8130 

Pro Ser Glu Cys Cys 
2665 

teg tea agt get gag 
8175 

Ser Ser Ser Ala Glu 
2680 

gaa atg gtt gag gac 
8220 

Glu Met Val Glu Asp 
2695 

gtg aag gtg etc tgc 
8265 

Val Lys Val Leu Cys 
2710 

gag ctg etc caa cgc 
8310 

Glu Leu Leu Gin Arg 
2725 

etc tea egg aat tec 
83 55 



FIGURE 12R 
Val Glu Arg Arg Phe 
2595 

gga tgt gga aga ggc 

Gly Cys Gly Arg Gly 
2610 

aga gtc caa gaa gtc 

Arg Val Gin Glu Val 
2625 

gaa gag ccc caa eta 

Glu Glu Pro Gin Leu 
2640 

atg aag agt gga gtg 

Met Lys Ser Gly Val 
2655 

gac acc etc ctt tgt 

Asp Thr Leu Leu Cys 
2670 

gtt gaa gag cat agg 

Val Glu Glu His Arg 
2685 

tgg ctg cac cga ggg 

Trp Leu His Arg Gly 
2700 

ccc tac atg ccg aaa 

Pro Tyr Met Pro Lys 
2715 

egg tat ggg ggg gga 

Arg Tyr Gly Gly Gly 
2730 

acg cac gag atg tat 



Leu Glu Pro Val Gly 
2600 

ggt tgg tgt tac tat 

Gly Trp Cys Tyr Tyr 
2615 

a 9 a 999 tac aca aa 9 

Arg Gly Tyr Thr Lys 
2630 

gtg caa agt tat gga 

Val Gin Ser Tyr Gly 
2645 

gat gtg ttc tac aga 

Asp Val Phe Tyr Arg 
2660 

gac ate gga gag tec 

Asp lie Gly Glu Ser 
2675 

acg att egg gtc ctt 

Thr lie Arg Val Leu 
2690 

cca agg gaa ttt tgc 

Pro Arg Glu Phe Cys 
2705 

gtc ata gag aag atg 

Val He Glu Lys Met 
2720 

ctg gtc aga aac cca 

Leu Val Arg Asn Pro 
2735 

tgg gtg agt cga get 
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Leu Ser Arg Asn Ser 
2740 

tea ggc aat gtg gta 
8400 

Ser Gly Asn Val Val 
2755 

eta gga aga atg gaa 
8445 

Leu Gly Arg Met Glu 
2770 

gaa gat gta aac ttg 
8490 

Glu Asp Val Asn Leu 
2785 

ctg etc aac tea gac 
8535 

Leu Leu Asn Ser Asp 
2800 

etc agg cgt gag tac 
8580 

Leu Arg Arg Glu Tyr 
2815 

cca tat aga acc tgg 
8625 

Pro Tyr Arg Thr Trp 
2830 

aca ggc tec gee agt 
8670 

Thr Gly Ser Ala Ser 
2845 

tea aaa cca tgg gac 
8715 

Ser Lys Pro Trp Asp 
2860 

act gac act act ccc 
8760 

Thr Asp Thr Thr Pro 
2875 

gtg gac acg aaa get 
8805 



FIGURE 12S 
Thr His Glu Met Tyr 
2745 

cat tea gtg aat atg 

His Ser Val Asn Met 
2760 

aaa agg acc tgg aag 

Lys Arg Thr Trp Lys 
2775 

gga agt gga acc agg 

Gly Ser Gly Thr Arg 
2790 

acc agt aaa ate aag 

Thr Ser Lys lie Lys 
2805 

agt teg acg tgg cac 

Ser Ser Thr Trp His 
2820 

aac tat cac ggc agt 

Asn Tyr His Gly Ser 
2835 

teg ctg gtc aat gga 

Ser Leu Val Asn Gly 
2850 

acc ate acg aat gtt 

Thr He Thr Asn Val 
2865 

ttc ggg cag cag cga 

Phe Gly Gin Gin Arg 
2880 

cct gaa ccg cca gaa 



Trp Val Ser Arg Ala 
2750 

acc age cag gtg etc 

Thr Ser Gin Val Leu 
2765 

gga ccc caa tac gag 

Gly Pro Gin Tyr Glu 
2780 

gcg gtg gga aaa ccc 

Ala Val Gly Lys Pro 
2795 

aac agg att gaa cga 

Asn. Arg He Glu Arg 
2810 

cac gat gag aac cac 

His Asp Glu Asn His 
2825 

tat gat gtg aag ccc 

Tyr Asp Val Lys Pro 
2840 

gtg gtc agg etc etc 

Val Val Arg Leu Leu 
2855 

acc acc atg gec atg 

Thr Thr Met Ala Met 
2870 

gtg ttc aaa gag aag 

Val Phe Lys Glu Lys 
2885 

gga gtg aag tac gtg 
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Val Asp Thr Lys Ala 
2890 

etc aat gag acc acc 
8850 

Leu Asn Glu Thr Thr 
2905 

aaa cgt ccc aga atg 
8895 

Lys Arg Pro Arg Met 
2920 

aac age aat gca get 
8940 

Asn Ser Asn Ala Ala 
2935 

tgg agg age gee aga 
8985 

Trp Arg Ser Ala Arg 
2950 

atg gtg gat gag gag 
9030 

Met Val Asp Glu Glu 
2965 

act tgc att tac aac 
9075 

Thr Cys lie Tyr Asn 
2980 

gag ttc gga aag gee 
9120 

Glu Phe Gly Lys Ala 
2995 

etc gga get cgc ttt 
9165 

Leu Gly Ala Arg Phe 
3010 

gaa gac cac tgg ctt 
9210 

Glu Asp His Trp Leu 
3 025 

ggc ttg ggc etc caa 
9255 



FIGURE 12T 
Pro Glu Pro Pro Glu 
2895 

aac tgg ttg tgg gcg 

Asn Trp Leu Trp Ala 
2910 

tgc tct cga gag gaa 

Cys Ser Arg Glu Glu 
2925 

ttg ggt gee atg ttt 

Leu Gly Ala Met Phe 
2940 

gaa gca gtt gaa gat 

Glu Ala Val Glu Asp 
2955 

cgc gag gca cat ctg 

Arg Glu Ala His Leu 
2970 

atg atg gga aag aga 

Met Met Gly Lys Arg 
2985 

aag gga age aga gee 

Lys Gly Ser Arg Ala 
3000 

ctg gag ttc gag get 

Leu Glu Phe Glu Ala 
3015 

gga aga aag aac tea 

Gly Arg Lys Asn Ser 
3030 

aaa ctg ggt tac ate 



Gly Val Lys Tyr Val 
2900 

ttt ttg gee aga gaa 

Phe Leu Ala Arg Glu 
2915 

ttc ata aga aag gtc 

Phe lie Arg Lys Val 
2930 

gaa gag cag aat caa 

Glu Glu Gin Asn Gin 
2945 

cca aaa ttt tgg gag 

Pro Lys Phe Trp Glu 
2960 

c 99 999 9 aa tgt cac 

Arg Gly Glu Cys His 
2975 

gag aaa aaa ccc gga 

Glu Lys Lys Pro Gly 
2 990 

att tgg ttc atg tgg 

lie Trp Phe Met Trp 
3005 

ctg 99t ttt etc aat 

Leu Gly Phe Leu Asn 
3020 

gga gga ggt gtc gag 

Gly Gly Gly Val Glu 
3035 

ctg cgt gaa gtt ggc 
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FIGURE 12U 

Gly Leu Gly Leu Gin Lys Leu Gly Tyr lie Leu Arg Glu Val Gly 
3040 3045 ' 3050 

acc egg cct ggg ggc aag ate tat get gat gac aca get ggc tgg 
9300 

Thr Arg Pro Gly Gly Lys lie Tyr Ala Asp Asp Thr Ala Gly Trp 
3055 3060 3065 

gac acc cgc ate acg aga get gac ttg gaa aat gaa get aag gtg 
9345 

Asp Thr Arg lie Thr Arg Ala Asp Leu Glu Asn Glu Ala Lys Val 
3070 3075 3080 

ctt gag ctg ctt gat ggg gaa cat egg cgt ctt gee agg gee ate 
9390 

Leu Glu Leu Leu Asp Gly Glu His Arg Arg Leu Ala Arg Ala lie 
3085 3090 3095 

att gag etc acc tat cgt cac aaa gtt gtg aaa gtg atg cgc ccg 
9435 

lie Glu Leu Thr Tyr Arg His Lys Val Val Lys Val Met Arg Pro 
3100 3105 3110 

get get gat gga aga acc gtc atg gat gtt ate tec aga gaa gat 
9480 

Ala Ala Asp Gly Arg Thr Val Met Asp Val lie Ser Arg Glu Asp 
3115 3120 3125 

cag agg ggg agt gga caa gtt gtc acc tac gee eta aac act ttc 
9525 

Gin Arg Gly Ser Gly Gin Val Val Thr Tyr Ala Leu Asn Thr Phe 
3130 3135 3140 

acc aac ctg gee gtc cag ctg gtg agg atg atg gaa ggg gaa gga 
9570 

Thr Asn Leu Ala Val Gin Leu Val Arg Met Met Glu Gly Glu Gly 
3145 3150 3155 

gtg att ggc cca gat gat gtg gag aaa etc aca aaa ggg aaa gga 
9615 

Val lie Gly Pro Asp Asp Val Glu Lys Leu Thr Lys Gly Lys Gly 
3160 3165 3170 

ccc aaa gtc agg acc tgg ctg ttt gag aat ggg gaa gaa aga etc 
9660 

Pro Lys Val Arg Thr Trp Leu Phe Glu Asn Gly Glu Glu Arg Leu 
3175 3180 3185 

age cgc atg get gtc agt gga gat gac tgt gtg gta aag ccc ctg 
9705 
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Ser Arg Met Ala Val 
3190 

gac gat cgc ttt gcc 
9750 

Asp Asp Arg Phe Ala 
3205 

aag gtt cgc aaa gac 
9795 

Lys Val Arg Lys Asp 
3220 

tat gat tgg cag cag 
9840 

Tyr Asp Trp Gin Gin 
3235 

ttg ate atg aaa gat 
9885 

Leu lie Met Lys Asp 
3250 

cag gat gaa ttg gta 
9930 

Gin Asp Glu Leu Val 
3265 

tgg aac gtc cgc gac 
9975 

Trp Asn Val Arg Asp 
3280 

atg tgg ctg ctt ctg 
10020 

Met Trp Leu Leu Leu 
3295 

gcc aac gcc att tgc 
10065 

Ala Asn Ala lie Cys 
3310 

gga aga acc acg tgg 
10110 

Gly Arg Thr Thr Trp 
3325 

aca gag gac atg ttg 
10155 



FIGURE 12V 
Ser Gly Asp Asp Cys 
3195 

acc teg etc cac ttc 

Thr Ser Leu His Phe 
3210 

ate caa gag tgg aaa 

lie Gin Glu Trp Lys 
3225 

gtt cca ttt tgc tea 

Val Pro Phe Cys Ser 
3240 

gga aga aca ctg gtg 

Gly Arg Thr Leu Val 
3255 

ggc aga get cgc ata 

Gly Arg Ala Arg lie 
3270 

act get tgt ctg get 

Thr Ala Cys Leu Ala 
3285 

tac ttc cac aga aga 

Tyr Phe His Arg Arg 
3300 

tec get gtc cct gtg 

Ser Ala Val Pro Val 
3315 

tec ate cat gca gga 

Ser lie His Ala Gly 
3330 

gag gtc tgg aac cgt 



Val Val Lys Pro Leu 
3200 

etc aat get atg tea 

Leu Asn Ala Met Ser 
3215 

ccg tea act gga tgg 

Pro Ser Thr Gly Trp 
3230 

aac cat ttc act gaa 

Asn His Phe Thr Glu 
3245 

gtt cca tgc cga gga 

Val Pro Cys Arg Gly 
3260 

tct cca ggg gcc gga 

Ser Pro Gly Ala Gly 
3275 

aag tct tat gcc cag 

Lys Ser Tyr Ala Gin 
3290 

gac ctg egg etc atg 

Asp Leu Arg Leu Met 
3305 

aat tgg gtc cct acc 

Asn Trp Val Pro Thr 
3320 

gga gag tgg atg aca 

Gly Glu Trp Met Thr 
3335 

gtt tgg ata gag gag 
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FIGURE 12W 

Thr Glu Asp Met Leu Glu Val Trp Asn Arg Val Trp He Glu Glu 
3340 3345 3350 

aat gaa tgg atg gaa gac aaa acc cca gtg gag aaa tgg agt gac 
10200 

Asn Glu Trp Met Glu Asp Lys Thr Pro Val Glu Lys Trp Ser Asp 
3355 3360 3365 

gtc cca tat tea gga aaa cga gag gac ate tgg tgt ggc age ctg 
10245 

Val Pro Tyr Ser Gly Lys Arg Glu Asp He Trp Cys Gly Ser Leu 
3370 3375 3380 

att ggc aca aga gec cga gec acg tgg gca gaa aac ate cag gtg 
10290 

He Gly Thr Arg Ala Arg Ala Thr Trp Ala Glu Asn He Gin Val 
3385 3390 3395 

get ate aac caa gtc aga gca ate ate gga gat gag aag tat gtg 
10335 

Ala He Asn Gin Val Arg Ala He He Gly Asp Glu Lys Tyr Val 
3400 3405 3410 

gat tac atg agt tea eta aag aga tat gaa gac aca act ttg gtt 
10380 

Asp Tyr Met Ser Ser Leu Lys Arg Tyr Glu Asp Thr Thr Leu Val 
3415 3420 3425 

gag gac aca gta ctg tag atatttaatc aattgtaaat agacaatata 
10428 

Glu Asp Thr Val Leu 
3430 

agtatgeata aaagtgtagt tttatagtag tatttagtgg tgttagtgta aatagttaag 
10488 

aaaattttga ggagaaagtc aggcegggaa gttcccgcca ccggaagttg agtagacggt 
10548 

gctgcctgcg actcaacccc aggaggactg ggtgaacaaa gecgegaagt gatccatgta 
10608 

agccctcaga accgtctcgg aaggaggacc ccacatgttg taacttcaaa gcccaatgtc 
10668 

agaccacgct aeggegtget actctgegga gagtgcagtc tgcgatagtg ccccaggagg 
10728 

actgggttaa caaaggcaaa ccaacgcccc acgcggccct agccccggta atggtgttaa 
10788 
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FIGURE 12X 

ccagggcgaa aggactagag gttagaggag accccgcggt ttaaagtgca cggcccagcc 
10848 

tgactgaagc tgtaggtcag gggaaggact agaggttagt ggagaccccg tgccacaaaa 
10908 

caccacaaca aaacagcata ttgacacctg ggatagacta ggagatcttc tgctctgcac 

.10968 , 

aaccagccac acggcacagt gcgccgacaa tggtggctgg tggtgcgaga acacaggatc 
11028 

t 

11029 



00I50681.DOC 



Figure 13, 5kb C5 locus and PCR primers to amplify C5 arms (SEQ ID NO: 66) 



CTGAAATTGTAATTTCTACATGTAGAGAAGGTTTTGATATTGATGGTTTTAACAGAAACG 
GACTTTAACATTAAAGATGTACATCTCTTCCAAAACTATAACTACCAAAATTGTCTTTGC 
10 20 30 40 50 60 



TAGAAATTATATCAAGGGATAACATTTTATATGATATAGTTTTAAAGTGTAAGATGGAAT 
ATCTTTAATATAGTTCCCTATTGTAAAATATACTATATCAAAATTTCACATTCTACCTTA 
70 80 90 100 110 120 



TAAATTTCATGTGCACAAGAGGCATAGGAGATAAAAGCATTTTCAGACTTTGTATAATGA 
ATTTAAAGTACACGTGTTCTCCGTATCCTCTATTTTCGTAAAAGTCTGAAACATATTACT 
130 140 150 160 170 180 

AGGAATATGATCAAATAAACAAGAATCTGTTAGTTAGTTACTTGGATAAATTAATCGAGA 
TCCTTATACTAGTTTATTTGTTCTTAGACAATCAATCAATGAACCTATTTAATTAGCTCT 
190 200 210 220 230 240 



CGCGTGATAAAATGACTATGTACCGTTATTGCATGAACGATATTATAAATATAGGTTCTC 
GCGCACTATTTTACTGATACATGGCAATAACGTACTTGCTATAATATTTATATCCAAGAG 
250 260 270 280 290 300 

(C5A1) GGCCGAATTC 
GTAGGAGAGAACTATTGACTATGGCA \ t GAATGTTAAATGTTATACTTT GGATGAAGCTA 
CATCCTCTCTTGATAACTGATACCGTTACTTACAATTTACAATATGAAACCTACTTCGAT 
310 320 330 340 350 360 



TAAATATGCATTGGAAAAATAATCCATTTAAAGAAAGGATTCAAATACTACAAAACCTAA 
ATTTATACGTAACCTTTTTATTAGGTAAATTTCTTTCCTAAGTTTATGATGTTTTGGATT 
370 380 390 400 410 420 



GCGATAATATGTTAACTAAGCTTATTCTTAACGACGCTTTAAATATACACAAATAAACAT 
CGCTATTATACAATTGATTCGAATAAGAATTGCTGCGAAATTTATATGTGTTTATTTGTA 
430 440 450 460 470 480 



AATTTTTGTATAACCTAACAAATAACTAAAACATAAAAATAATAAAAGGAAATGTAATAT 
TTAAAAACATATTGGATTGTTTATTGATTTTGTATTTTTATTATTTTCCTTTACATTATA 
490 500 510 520 530 540 



00150910 



CGTAATTATTTTACTCAGGAATGGGGTTAAATATTTATATCACGTGTATATCTATACTGT 
GCATTAATAAAATGAGTCCTTACCCCAATTTATAAATATAGTGCACATATAGATATGACA 
550 560 570 580 590 600 



TATCGTATACTCTTTACAATTACTATTACGAATATGCAAGAGATAATAAGATTACGTATT 
ATAGCATATGAGAAATGTTAATGATAATGCTTATACGTTCTCTATTATTCTAATGCATAA 
610 620 630 640 650 660 



TAAGAGAATCTTGTCATGATAATTGGGTACGACATAGTGATAAATGCTATTTCGCATCGT 
ATTCTCTTAGAACAGTACTATTAACCCATGCTGTATCACTATTTACGATAAAGCGTAGCA 
670 680 690 700 710 720 

TACATAAAGTCAGTTGGAAAGATGGATTTGACAGATGTAACTTAATAGGTGCAAAAATGT 
ATGTATTTCAGTCAACCTTTCTACCTAAACTGTCTACATTGAATTATCCACGTTTTTACA 
730 740 750 760 770 780 



TAAATAACAGCATTCTATCGGAAGATAGGATACCAGTTATATTATACAAAAATCACTGGT 
ATTTATTGTCGTAAGATAGCCTTCTATCCTATGGTCAATATAATATGTTTTTAGTGAGCA 
790 800 810 820 830 840 



TGGATAAAACAGATTCTGCAATATTCGTAAAAGATGAAGATTACTGCGAATTTGTAAACT 
ACCTATTTTGTCTAAGACGTTATAAGCATTTTCTACTTCTAATGACGCTTAAACATTTGA 
850 860 870 880 890 900 



ATGACAATAAAAAGCCATTTATCTCAACGACATCGTGTAATTCTTCCATGTTTTATGTAT 
TACTGTTATTTTTCGGTAAATAGAGTTGCTGTAGCACATTAAGAAGGTACAAAATACATA 
910 920 930 940 950 960 



GTGTTTCAGATATTATGAGATTACTATAAACTTTTTGTATACTTATATTCCGTAAACTAT 
CACAAAGTCTATAATACTCTAATGATATTTGAAAAACATATGAATATAAGGCATTTGATA 
970 980 990 1000 1010 1020 



ATTAATCATGAAGAAAATGAAAAAGTATAGAAGCTGTTCACGAGCGGTTGTTGAAAACAA 
TAATTAGTACTTCTTTTACTTTTTCATATCTTCGACAAGTGCTCGCCAACAACTTTTGTT 
1030 1040 1050 1060 1070 1080 



CAAAATTATACATTCAAGATGGCTTACATATACGTCTGTGAGGCTATCATGGATAATGAC 
GTTTTAATATGTAAGTTCTACCGAATGTATATGCAGACACTCCGATAGTACCTATTACTG 
1090 1100 1110 1120 1130 1140 
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AATGCATCTCTAAATAGGTTTTTGGACAATGGATTCGACCCTAACACGGAATATGGTACT 
TTACGTAGAGATTTATCCAAAAACCTGTTACCTAAGCTGGGATTGTGCCTTATA C'C A T G A 
1150 1160 1170 1180 1190 1200 

CTACAATCTCCTCTTGAAATGGCTGTAATGTTCAAGAATACCGAGGCTATAAAAATCTTG 
GATGTTAGAGGAGAACTTTACCGACATTACAAGTTCTTATGGCTCCGATATTTTTAGAAC 
1210 1220 1230 1240 1250 1260 

ATGAGGTATGGAGCTAAACCTGTAGTTACTGAATGCACAACTTCTTGTCTGCATGATGCG 
TACTCCATACCTCGATTTGGACATCAATGACTTACGTGTTGAAGAACAGACGTACTACGC 
1270 1280 1290 1300 1310 1320 



GTGTTGAGAGACGACTACAAAATAGTGAAAGATCTGTTGAAGAATAACTATGTAAACAAT 
CACAACTCTCTGCTGATGTTTTATCACTTTCTAGACAACTTCTTATTGATACATTTGTTA 
1330 1340 1350 1360 1370 1380 



GTTCTTTACAGCGGAGGCTTTACTCCTTTGTGTTTGGCAGCTTACCTTAACAAAGTTAAT 
CAAGAAATGTCGCCTCCGAAATGAGGAAACACAAACCGTCGAATGGAATTGTTTCAATTA 
1390 1400 1410 1420 1430 1440 

TTGGTTAAACTTCTATTGGCTCATTCGGCGGATGTAGATATTTCAAACACGGATCGGTTA 
AACCAATTTGAAGATAACCGAGTAAGCCGCCTACATCTATAAAGTTTGTGCCTAGCCAAT 
1450 1460 1470 1480 1490 1500 

ACTCCTCTACATATAGCCGTATCAAATAAAAATTTAACAATGGTTAAACTTCTATTGAAC 
TGAGGAGATGTATATCGGCATAGTTTATTTTTAAATTGTTACCAATTTGAAGATAACTTG 
1510 1520 1530 1540 1550 1560 

AAAGGTGCTGATACTGACTTGCTGGATAACATGGGACGTACTCCTTTAATGATCGCTGTA 
TTTCCACGACTATGACTGAACGACCTATTGTACCCTGCATGAGGAAATTACTAGCGACAT 
1570 1580 1590 1600 1610 1620 



CAATCTGGAAATATTGAAATATGTAGCACACTACTTAAAAAAAATAAAATGTCCAGAACT 
GTTAGACCTTTATAACTTTATACATCGTGTGATGAATTTTTTTTATTTTACAGGTCTTGA 
1630 1640 1650 1660 1670 1680 

GGGAAAAATTGATCTTGCCAGCTGTAATTCATGGTAGAAAAGAAGTGCTCAGGCTACTTT 
CCCTTTTTAACTAGAACGGTCGACATTAAGTACCATCTTTTCTTCACGAGTCCGATGAAA 
1690 1700 1710 1720 1730 1740 
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TCAACAAAGGAGCAGATGTAAACTACATCTTTGAAAGAAATGGAAAATCATATACTGTTT 
AGTTGTTTCCTCGTCTACATTTGATGTAGAAACTTTCTTTACCTTTTAGTATATGACAAA 
1750 1760 1770 1780 1790 1800 



TGGAATTGATTAAAGAAAGTTACTCTGAGACACAAAAGAGGTAGCTGAAGTGGTACTCTC 
ACCTTAACTAATTTCTTTCAATGAGACTCTGTGTTTTCTCC ATCGACTTCACCATGAGAG 
1810 1820 1830 1840 1850 1860 



C5 QRF 



MET ON A9J 


ASP 


ASP CTS OU 


ALA 


ARG 


SER 


ARG 


GLU HE 


1HR LEU TYR 


ASP 


PHE 


LEU 


AAAATGCAGAAC 


GAT 


GACTGCGAA 


G C A 


A G A 


A G T 


A G A 


G A A A T A 


ACACTTTAT 


G A C 


T T T 


C T T 


TTTTACGTCTTG 


C T A 


CTGACGCTT 


C G T 


T C T 


T C A 


T C T 


C T T T A T 


TGTGAAATA 


C T G 


AAA 


G A A 


\ 1870 




1880 




1890 






1900 


1910 






1920 



CCATGCACT GA_T T A AT C G A[T AT TT T Tj |C C AT G G C C C (C5B1) 



SER CYS ARG LYS 


ASP ARG ASP ILE 


MET MET 


VAL ILE ASN ASN 


SER 


ASP HE 


ALA 


SER 


LYS- 


AGTTGTAGAAAA 


GATAGAGATATA 


A T G A T G 


GTC AT A AATAAC 


T C T 


G A T A T T 


G C A 


A G T 


AAA 


TCAACATCTTTT 


CTATCTCTATAT 


T A C T A C 


CAGTATTTATTG 


A G A 


C T AT A A 


C G T 


T C A 


T T T 


1930 


1940 


1950 


1960 




1970 






1980 



CYS ASN ASN LYS LEU ASP LEU EHE LYS ARG 
TGCAATAATAAGTTAGATTTATTTAAAAGG 
ACGTTATTATTCAATCTAAATAAATTTTCC 
1990 2000 2010 



ILE VAL LYS ASN ARG LYS LYS GLU LHJ HE 
ATAGTTAAAAATAGAAAAAAAGAGTTAATT 
TATCAATTTTTATCTTTTTTTCTCAATTAA 
2020 2030 2040 



CYS ARG VAL LYS ILE ILE HIS LYS ILE LEU 
TGTAGGGTTAAAATAATACATAAGATCTTA 
ACATCCCAATTTTATTATGTATTCTAGAAT 
2050 2060 2070 



LYS PHE ILE ASN THR HIS ASN ASN LYS ASN 
AAATTTATAAATACGCATAATAATAAAAAT 
TTTAAATATTTATGCGTATTATTATTTTTA 
2080 2090 2100 



(C5C1) iGGATCqCGGGfTTTT TAjGAC T_AG TTAATC ACGGCCQ 
ARG LEU TYR LEU LEU FPQ SER GLU ILE LYS FHE LYS ILE FKE TOR TYR LHJ \ TOR TYR LYS 
AGATTATACTTATTACCTTCAGAGATAAAATTTAAGATATTTACTTATTTAA CTTATAAA 
TCTAATATGAATAATGGAAGTCTCTATTTTAAATTCTATAAATGAATAAATTGAATATTT 
2110 2120 2130 2140 2150 2160 



ASP LEU LYS CYS ILE ILE SER LYS *** 
GATCTAAAATGCAT AATTTCTAAATAATGAAAAAAAGTACATCATGAGCAACGCGTTAGT 
CTAGATTTTACGTATTAAAGATTTATTACTTTTTTTCATGTAGTACTCGTTGCGCAATCA 
2170 2180 2190 2200 2210 2220 



ATATTTTACAATGGAGATTAACGCTCTATACCGTTCTATGTTTATTGATTCAGATGATGT 
TATAAAATGTTACCTCTAATTGCGAGATATGGCAAGATACAAATAACTAAGTCTACTACA 
2230 2240 2250 2260 2270 2280 



TTTAGAAAAGAAAGTTATTGAATATGAAAACTTTAATGAAGATGAAGATGACGACGATGA 
AAATCTTTTCTTTCAATAACTTATACTTTTGAAATTACTTCTACTTCTACTGCTGCTACT 
2290 2300 2310 2320 2330 2340 

TTATTGTTGTAAATCTGTTTTAGATGAAGAAGATGACGCGCTAAAGTATACTATGGTTAC 
AATAACAACATTTAGACAAAATCTACTTCTTCTACTGCGCGATTTCATATGATACCAATG 
2350 2360 2370 2380 2390 2400 

AAAGTATAAGTCTATACTACTAATGGCGACTTGTGCAAGAAGGTATAGTATAGTGAAAAT 
TTTCATATTCAGATATGATGATTACCGCTGAACACGTTCTTCCATATCATATCACTTTTA 
2410 2420 2430 2440 2450 2460 

GTTGTTAGATTATGATTATGAAAAACCAAATAAATCAGATCCATATCTAAAGGTATCTCC 
CAACAATCTAATACTAATACTTTTTGGTTTATTTAGTCTAGGTATAGATTTCCATAGAGG 
2470 2480 2490 2500 2510 2520 

ATTTGTTTACAGC 
TAAACAAATGTCG 
2570 2580 
(C5D1) 



TGAAGACGAAAAAAATATATCGATAATAGAAGATTATGTTAACTCTGCTAATAAGATGAA 
ACTTCTGCTTTTTTTATATAGCTATTATCTTCTAATACAATTGAGACGATTATTCTACTT 
2590 2600 2610 2620 2630 2640 



ATTGAATGAGTCTGTGATAATAGCTATAATCAGAGAAGTTCTAAAAGGAAATAAAAATCT 
TAACTTACTCAGACACTATTATCGATATTAGTCTCTTCAAGATTTTCCTTTATTTTTAGA 
2650 2660 2670 2680 2690 2700 



AACTGATCAGGATATAAAAACATTGGCTGATGAAATCAACAAGGAGGAACTGAATATAGC 
TTGACTAGTCCTATATTTTTGTAACCGACTACTTTAGTTGTTCCTCCTTGACTTATATCG 
2710 2720 2730 2740 2750 2760 

TAAACTATTGTTAGATAGAGGGGCCAAAGTAAATTACAAGGATGTTTACGGTTCTTCAGC 
ATTTGATAACAATCTATCTCCCCGGTTTCATTTAATGTTCCTACAAATGCCAAGAAGTCG 
2770 2780 2790 2800 2810 2820 

TCTCCATAGAGCTGCTATTGGTAGGAAACAGGATATGATAAAGCTGTTAATCGATCATGG 
AGAGGTATCTCGACGATAACCATCCTTTGTCCTATACTATTTCGACAATTAGCTAGTACC 
2830 2840 2850 2860 2870 2880 



TTTGCACATAATTTCATCTATTCCTAGTTTAGAATACTTTTCATTAT 
AAACGTGTATTAAAG TAGATAAGGATCAAATCTTATG AAAAGTAATA 
2530 2540 2550 \ 2560 

\gacgtcgg 
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I 



AGCTGATGTAAACTCTTTAACTATTGCTAAAGATAATCTTATTAAAAAAAAATAATATCA 
TCGACTACATTTGAGAAATTGATAACGATTTCTATTAGAATAATTTTTTTTTATTATAGT 
2890 2900 2910 2920 2930 2940 



CGTTTAGTAATATTAAAATATATTAATAACTCTATTACTAATAACTCCAGTGGATATGAA 
GCAAATCATTATAATTTTATATAATTATTGAGATAATGATTATTGAGGTCACCTATACTT 
2950 2960 2970 2980 2990 3000 



CATAATACGAAGTTTATACATTCTCATCAAAATCTTATTGACATCAAGTTAGATTGTGAA 
GTATTATGCTTCAAATATGTAAGAGTAGTTTTAGAATAACTGTAGTTCAATCTAACACTT 
3010 3020 3030 3040 3050 3060 



AATGAGATTATGAAATTAAGGAATACAAAAATAGGATGTAAGAACTTACTAGAATGTTTT 
TTACTCTAATACTTTAATTCCTTATGTTTTTATCCTACATTCTTGAATGATCTTACAAAA 
3070 3080 3090 3100 3110 3120 



ATCAATAATGATATGAATACAGTATCTAGGGCTATAAACAATGAAACGATTAAAAATTAT 
TAGTTATTACTATACTTATGTCATAGATCCCGATATTTGTTACTTTGCTAATTTTTAATA 
3130 3140 3150 3160 3170 3180 



AAAAATCATTTCCCTATATATAATACGCTCATAGAAAAATTCATTTCTGAAAGTATACTA 
TTTTTAGTAAAGGGATATATATTATGCGAGTATCTTTTTAAGTAAAGACTTTCATATGAT 
3190 3200 3210 3220 3230 3240 



AGACACGAATTATTGGATGGAGTTATAAATTCTTTTCAAGGATTCAATAATAAATTGCCT 
TCTGTGCTTAATAACCTACCTCAATATTTAAGAAAAGTTCCTAAGTTATTATTTAACGGA 
3250 3260 3270 3280 3290 3300 



TACGAGATTCAGTACATTATACTGGAGAATCTTAATAACCATGAACTAAAAAAAATTTTA 
ATGCTCTAAGTCATGTAATATGACCTCTTAGAATTATTGGTACTTGATTTTTTTTAAAAT 
3310 3320 3330 3340 3350 3360 



GATAATATACATTAAAAAGGTAAATAGATCATCTGTTATTATAAGCAAAGATGCTTGTTG 
CTATTATATGTAATTTTTCCATTTATCTAGTAGACAATAATATTCGTTTCTACGAACAAC 
3370 3380 3390 3400 3410 3420 



CCAATAATATACAACAGGTATTTGTTTTTATTTTTAACTACATATTTGATGTTCATTCTC 
GGTTATTATATGTTGTCCATAAACAAAAATAAAAATTGATGTATAAACTACAAGTAAGAG 
3430 3440 3450 3460 3470 3480 
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TTTATATAGTATACACAGAAAATTCATAATCCACTTAGAATTTCTAGTTATCTAGTTTTT 
AAATATATCATATGTGTCTTTTAAGTATTAGGTGAATCTTAAAGATCAATAGATCAAAAA 
3490 3500 3510 3520 3530 3540 



CTAGAATATTGTACTTTATTTCTAATGGAATGGCTCTCCAGCCTAGTAATTTATTAATGT 
GATCTTATAACATGAAATAAAGATTACCTTACCGAGAGGTCGGATCATTAAATAATTACA 
3550 3560 3570 3580 3590 3600 



TAGCTGATATCTTGAAATCAGGATATTCTGCTCCGTGAAGAGAAAGTCCTCCAAAGTTGT 
ATCGACTATAGAACTTTAGTCCTATAAGACGAGGCACTTCTCTTTCAGGAGGTTTCAACA 
3610 3620 3630 3640 3650 3660 



ATATTTCCATCACTTTCATGGCTTCCTCTTTCCATAGTGTCTTCTATAAGCTGTCTATAT 
TATAAAGGTAGTGAAAGTACCGAAGGAGAAAGGTATCACAGAAGATATTCGACAGATATA 
3670 3680 3690 3700 3710 3720 



ATTGTAAACTTTTCTGGTTTTATGCATTTTAAACATTTAGCAATCTCATTTTCATCACAA 
TAACATTTGAAAAGACCAAAATACGTAAAATTTGTAAATCGTTAGAGTAAAAGTAGTGTT 
3730 3740 3750 3760 3770 3780 



TTAAGGCACAAATCTAACATGGAATGTCTACCATAACCCAATAAGGTTTTTTCATTTCCT 
AATTCCGTGTTTAGATTGTACCTTACAGATGGTATTGGGTTATTCCAAAAAAGTAAAGGA 
3790 3800 3810 3820 3830 3840 



CTATCTCTAATACACACTGTTCTTTCCAGACTTTCAACACGCTGCTATTTTCTATTTTAT 
GATAGAGATTATGTGTGACAAGAAAGGTCTGAAAGTTGTGCGACGATAAAAGATAAAATA 
3850 3860 3870 3880 3890 3900 



TCAAGTCCATATTATAAGCGTCCTTGTTAGACACTTCATAATGTTTGCATTCTGGAATCA 
AGTTCAGGTATAATATTCGCAGGAACAATCTGTGAAGTATTACAAACGTAAGACCTTAGT 
3910 3920 3930 3940 3950 3960 



TCATGTTAGATACTATTAATTTAGCTACTTCTATGTTGTCATCAAAAGAGTTGCTATCTG 
AGTACAATCTATGATAATTAAATCGATGAAGATACAACAGTAGTTTTCTCAACGATAGAC 
3970 3980 3990 4000 4010 4020 



TAATTACACTAAGAGGTGTATCACCTGATAAAGAAGTAATAGAGACATCTGCTCTGAATT 
ATTAATGTGATTCTCCACATAGTGGACTATTTCTTCATTATCTCTGTAGACGAGACTTAA 
4030 4040 4050 4060 4070 4080 
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TAAGCAATACCTCAATAACTTCTTTTGAAGATGACTTTGCAGCTAAAAATAATGGAGTTC 
ATTCGTTATGGAGTTATTGAAGAAAACTTCTACTGAAACGTCGATTTTTATTACCTCAAG 
4090 4100 4110 4120 4130 4140 



TCTCTAAAACATCCCTTGAGTTTATATTAGCCCCGTAACTAATGAGTAGTTCTGTTATAT 
AGAGATTTTGTAGGGAACTCAAATATAATCGGGGCATTGATTACTCATCAAGACAATATA 
4150 4160 4170 4180 4190 4200 



CTTTGGAATCTATTGATATATTATTAATTACAATTGTCATGCTGACATATATAGACATCA 
GAAACCTTAGATAACTATATAATAATTAATGTTAACAGTACGACTGTATATATCTGTAGT 
4210 4220 4230 4240 4250 4260 



TAATATGATGAAAAATATGAAAATATAAGTGCACGTTTACTGTTACTATGATTGTGATAT 
ATTATACTACTTTTTATACTTTTATATTCACGTGCAAATGACAATGATACTAACACTATA 
4270 4280 4290 4300 4310 4320 



CGATATGAGTTCTTTAATAAAAGTACTGAAATAGATATAATGCAGATATGATTGATATTT 
GCTATACTCAAGAAATTATTTTCATGACTTTATCTATATTACGTCTATACTAACTATAAA 
4330 4340 4350 4360 4370 4380 



TAAAAAGTTGAAAAAAAATATGCCCTGTTTACAAATACTATTTGGAAATATTCTGTAATA 
ATTTTTCAACTTTTTTTTATACGGGACAAATGTTTATGATAAACCTTTATAAGACATTAT 
4390 4400 4410 4420 4430 4440 



AAGTAATAGTGATATGTCAGTCACGATGGATTTGCCAATTGATCATATGAGTATAGATAA 
TTCATTATCACTATACAGTCAGTGCTACCTAAACGGTTAACTAGTATACTCATATCTATT 
4450 4460 4470 4480 4490 4500 



CATAAACGAGTATAATAAAAATGGATATACTAGACTCTATATAGAGGTAGCCATGAAAAA 
GTATTTGCTCATATTATTTTTACCTATATGATCTGAGATATATCTCCATCGGTACTTTTT 
4510 4520 4530 4540 4550 4560 



ACGTAAAAACGTAGATAGACTTTTATATCTCGGAGCTGATCCGAATCTGGCTAGTGTAGA 
TGCATTTTTGCATCTATCTGAAAATATAGAGCCTCGACTAGGCTTAGACCGATCACATCT 
4570 4580 4590 4600 4610 4620 



TTCGTATTGTCCTCTTCATATTGCTGTTAGGAATGGTAGTTTAAAGATAATAAGATCATT 
AAGCATAACAGGAGAAGTATAACGACAATCCTTACCATCAAATTTCTATTATTCTAGTAA 
4630 4640 4650 4660 4670 4680 
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GTTGAAATATGGTGCTAATATAAATCAAGAATGTCATGAAGGAGATACTGCTTTGATGAT 
CAACTTTATACCACGATTATATTTAGTTCTTACAGTACTTCCTCTATGACGAAACTACTA 
4690 4700 4710 4720 4730 4740 



GGCTATATCATTAGGTAATTATACAGCATGTAAAACACTTCTAGATAACAACGCCGATCC 
CCGATATAGTAATCCATTAATATGTCGTACATTTTGTGAAGATCTATTGTTGCGGCTAGG 
4750 4760 4770 4780 4790 4800 



TAATTATGTTAACTATTACGGTATAGTTCCGCTTATTAGAGCAATTATATGTGAAAAGCC 
ATTAATACAATTGATAATGCCATATCAAGGCGAATAATCTCGTTAATATACACTTTTCGG 
4810 4820 4830 4840 4850 4860 

TGACATAGTTAGACTGCTATTAGATAGAGGAGCTAATTGCAACCACTTAATTACAAAAAA 
ACTGTATCAATCTGACGATAATCTATCTCCTCGATTAACGTTGGTGAATTAATGTTTTTT 
4870 4880 4890 4900 4910 4920 



CGGTAGAACCTATACTGCTTTAGAGAGTCTTAGGAATTGCTTTTTTAAAGACAATTCTTC 
GGCATCTTGGATATGACGAAATCTCTCAGAATCCTTAACGAAAAAATTTCTGTTAAGAAG 
4930 4940 4950 4960 4970 4980 



ATCATTGTCGATACTAATAT 
TAGTAACAGCTATGATTATA 
4990 5000 
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Figure 14. VQH6 amplified fragment (SEQ ID NO:76) 

VQ marker 



BamH I 



V 



R Q G 



N 



AAAGGATCCGGGTTAATTAATTAGTCATCAGGCAGGGCGAGAACGAGACT 



TTTCCTAGGCCCAATTAATTAATCAGTAGTCCGTCCCGCTCTTGCTCTGA 



ICS-*** => H6p 

ATCTGCTCGT TAA T TAA T TAG AGCTTCTTTATTCTATACTTAAAAAGTGA 
TAGACGAGCAATTAATTAATCTCGAAGAAATAAGATATGAATTTTTCACT 



AAATAAATACAAAGGTTCTTGAGGGTTGTGTTAAATTGAAAGCGAGAAAT 
TTTATTTATGTTTCCAAGAACTCCCAACACAATTTAACTTTCACTCTTTA 



Ec oR V 

aatcataaattatttcattatcgc |gatatc| cgttaagtttgtatcgtagg 
ttagtatttaataaagtaatagcgctataggcaattcaaacatagcatIcc 



Kpn I Xho I Xba I Cla I Sma I 

tacc |ctcg"ag| tctag a|atcgat| cccgggtttt 

ATGGlGAGCTClAGATCTlTAGCTAlGGGCCClAAAA 



Figure 15. Restriction map of pNVQH6C5LSP-18 




